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APPENDIX 



SEQ ID NO: 1 

5 atg ctg ggc ate tgg acc etc eta cct ctg gtt ctt acg tct gtt get aga tta 



SEQ ID NO: 2 (cecropin pro) 
GCG CCA GAG CCG AAA 

10 

SEQ ID NO: 3 (cecropin pro extended) 

GCG CCA GAG CCG AAA TGG AAA GTC TTC AAG 

SEQ ID NO: 4 (cecropin prepro) 
15 AAT TTC TCA AGG ATA TTT TTC TTC GTG TTC GCT TTG GTT CTG GCT TTG TCA ACA 
GTT TCG GCT GCG CCA GAG CCG AAA 

SEQ ID NO: 5 (cecropin prepro extended) 

AAT TTC TCA AGG ATA TTT TTC TTC GTG TTC GCT TTG GTT CTG GCT TTG TCA ACA 
20 GTT TCG GCT GCG CCA GAG CCG AAA TGG AAA GTC TTC AAG 

SEQ ID NO: 6 (pTnMCS) 

1 ctgacgcgcc ctgtagcggc geattaageg cggcgggtgt ggtggttacg cgcagcgtga 

25 61 ccgctacact tgccagcgcc ctagcgcccg ctcctttcgc tttcttccct tcctttctcg 
121 ccacgttcgc eggcatcaga ttggctattg gecattgeat acgttgtatc catatcataa 
181 tatgtacatt tatattggct catgtccaac attaccgcca tgttgacatt gattattgac 
241 tagttattaa tagtaatcaa ttacggggtc attagttcat ageccatata tggagttccg 
301 cgttacataa ettaeggtaa atggcccgcc tggctgaccg cccaacgacc cccgcccatt 

30 361 gaegtcaata atgacgtatg ttcccatagt aacgecaata gggactttcc attgaegtea 
421 atgggtggag tatttaeggt aaactgccca cttggcagta catcaagtgt ateatatgee 
4 81 aagtacgccc cctattgacg teaatgaegg taaatggccc gectggcatt atgcccagta 
541 catgacctta tgggactttc ctacttggca gtacatctac gtattagtca tegctattae 
601 catggtgatg cggttttggc agtacatcaa tgggcgtgga tagcggtttg actcaegggg 

35 661 atttccaagt ctccacccca ttgacgtcaa tgggagtttg ttttggcacc aaaatcaacg 
721 ggactttcca aaatgtcgta acaactccgc cccattgacg caaatgggcg gtaggcgtgt 
781 acggtgggag gtctatataa gcagagctcg tttagtgaac egtcagateg cctggagacg 
841 ccatccacgc tgttttgacc tccatagaag acaccgggac cgatccagcc tccgcggccg 
901 ggaacggtgc attggaaege ggattccccg tgccaagagt gaegtaagta ccgcctatag 

40 961 actctatagg cacacccctt tggctcttat geatgetata ctgtttttgg ettggggect 
1021 atacaccccc gcttccttat gctataggtg atggtatagc ttagectata ggtgtgggtt 
1081 attgaccatt attgaccact cccctattgg tgacgatact ttccattact aatccataac 
1141 atggctcttt gccacaacta tctctattgg etatatgeca atactctgtc cttcagagac 
1201 tgacaeggae tctgtatttt tacaggatgg ggtcccattt attatttaca aattcacata 

45 1261 tacaacaacg ccgtcccccg tgcccgcagt ttttattaaa catagcgtgg gatctccacg 
1321 egaatctegg gtacgtgttc cggacatggg ctcttctccg gtageggegg agcttccaca 
1381 tccgagccct ggtcccatgc ctccagcggc teatggtege tcggcagctc cttgctccta 
14 41 acagtggagg ccagacttag gcacagcaca atgcccacca ccaccagtgt gccgcacaag 
1501 gccgtggcgg tagggtatgt gtctgaaaat gagcgtggag attgggctcg cacggctgac 

50 1561 gcagatggaa gacttaaggc ageggcagaa gaagatgcag gcagctgagt tgttgtattc 
1621 tgataagagt cagaggtaac tcccgttgcg gtgctgttaa cggtggaggg cagtgtagtc 
1681 tgagcagtac tcgttgctgc cgcgcgcgcc accagacata atagctgaca gactaacaga 
1741 ctgttccttt ccatgggtct tttctgcagt caccgtcgga ccatgtgcga actcgatatt 
1801 ttacacgact ctctttacca attctgcccc gaattacact taaaacgact caacagctta 

55 18 61 acgttggctt gccacgcatt acttgactgt aaaactctca ctcttaccga acttggccgt 
1921 aacctgccaa ecaaagegag aacaaaacat aacatcaaac gaatcgaccg attgttaggt 
1981 aatcgtcacc tccacaaaga gcgactcgct gtataccgtt ggcatgetag ctttatctgt 
2041 tegggcaata cgatgcccat tgtacttgtt gactggtctg atattcgtga gcaaaaacga 
2101 cttatggtat tgegagctte agtegcacta caeggtegtt ctgttactct ttatgagaaa 

60 2161 gcgttcccgc tttcagagca atgttcaaag aaagctcatg accaatttct agccgacctt 
2221 gegagcatte taccgagtaa caccacaccg ctcattgtca gtgatgctgg ctttaaagtg 
2281 ccatggtata aatccgttga gaagctgggt tggtactggt taagtcgagt aagaggaaaa 
2341 gtacaatatg cagacctagg ageggaaaac tggaaaccta tcagcaactt acatgatatg 
2401 tcatctagtc actcaaagac tttaggctat aagaggctga ctaaaagcaa tccaatctca 
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24 61 tgccaaattc tattgtataa atctcgctct aaaggccgaa aaaatcagcg ctcgacacgg 
2521 actcattgtc accacccgtc acctaaaatc tactcagcgt cggcaaagga gccatgggtt 
2581 ctagcaacta acttacctgt tgaaattcga acacccaaac aacttgttaa tatctattcg 
2641 aagcgaatgc agattgaaga aaccttccga gacttgaaaa gtcctgccta cggactaggc 
5 27 01 ctacgccata gccgaacgag cagctcagag cgttttgata tcatgctgct aatcgccctg 
27 61 atgcttcaac taacatgttg gcttgcgggc gttcatgctc agaaacaagg ttgggacaag 
2821 cacttccagg ctaacacagt cagaaatcga aacgtactct caacagttcg cttaggcatg 
2881 gaagttttgc ggcattctgg ctacacaata acaagggaag acttactcgt ggctgcaacc 
2941 ctactagctc aaaatttatt cacacatggt tacgctttgg ggaaattatg aggggatcgc 

10 3001 tctagagcga tccgggatct cgggaaaagc gttggtgacc aaaggtgcct tttatcatca 
3061 ctttaaaaat aaaaaacaat tactcagtgc ctgttataag cagcaattaa ttatgattga 
3121 tgcctacatc acaacaaaaa ctgatttaac aaatggttgg tctgccttag aaagtatatt 
3181 tgaacattat cttgattata ttattgataa taataaaaac cttatcccta tccaagaagt 
3241 gatgcctatc attggttgga atgaacttga aaaaaattag ccttgaatac attactggta 

15 3301 aggtaaacgc cattgtcagc aaattgatcc aagagaacca acttaaagct ttcctgacgg 
3361 aatgttaatt ctcgttgacc ctgagcactg atgaatcccc taatgatttt ggtaaaaatc 
3421 attaagttaa ggtggataca catcttgtca tatgatcccg gtaatgtgag ttagctcact 
3481 cattaggcac cccaggcttt acactttatg cttccggctc gtatgttgtg tggaattgtg 
3541 agcggataac aatttcacac aggaaacagc tatgaccatg attacgccaa gcgcgcaatt 

20 3601 aaccctcact aaagggaaca aaagctggag ctccaccgcg gtggcggccg ctctagaact 
3661 agtggatccc ccgggctgca ggaattcgat atcaagctta tcgataccgc tgacctcgag 
3721 ggggggcccg gtacccaatt cgccctatag tgagtcgtat tacgcgcgct cactggccgt 
3781 cgttttacaa cgtcgtgact gggaaaaccc tggcgttacc caacttaatc gccttgcagc 
3841 acatccccct ttcgccagct ggcgtaatag cgaagaggcc cgcaccgatc gcccttccca 

25 3901 acagttgcgc agcctgaatg gcgaatggaa attgtaagcg ttaatatttt gttaaaattc 
3961 gcgttaaatt tttgttaaat cagctcattt tttaaccaat aggccgaaat cggcaaaatc 
4021 ccttataaat caaaagaata gaccgagata gggttgagtg ttgttccagt ttggaacaag 
4081 agtccactat taaagaacgt ggactccaac gtcaaagggc gaaaaaccgt ctatcagggc 
4141 gatggcccac tactccggga tcatatgaca agatgtgtat ccaccttaac ttaatgattt 

30 4201 ttaccaaaat cattagggga ttcatcagtg ctcagggtca acgagaatta acattccgtc 
42 61 aggaaagctt atgatgatga tgtgcttaaa aacttactca atggctggtt atgcatatcg 
4321 caatacatgc gaaaaaccta aaagagcttg ccgataaaaa aggccaattt attgctattt 
4381 accgcggctt tttattgagc ttgaaagata aataaaatag ataggtttta tttgaagcta 
44 41 aatcttcttt atcgtaaaaa atgccctctt gggttatcaa gagggtcatt atatttcgcg 

35 4501 gaataacatc atttggtgac gaaataacta agcacttgtc tcctgtttac tcccctgagc 
4561 ttgaggggtt aacatgaagg tcatcgatag caggataata atacagtaaa acgctaaacc 
4621 aataatccaa atccagccat cccaaattgg tagtgaatga ttataaataa cagcaaacag 
4 681 taatgggcca ataacaccgg ttgcattggt aaggctcacc aataatccct gtaaagcacc 
4741 ttgctgatga ctctttgttt ggatagapat cactccctgt aatgcaggta aagcgatccc 

40 4801 accaccagcc aataaaatta aaacagggaa aactaaccaa ccttcagata taaacgctaa 
48 61 aaaggcaaat gcactactat ctgcaataaa tccgagcagt actgccgttt tttcgcccat 
4 921 ttagtggcta ttcttcctgc cacaaaggct tggaatactg agtgtaaaag accaagaccc 
4981 gtaatgaaaa gccaaccatc atgctattca tcatcacgat ttctgtaata gcaccacacc 
5041 gtgctggatt ggctatcaat gcgctgaaat aataatcaac aaatggcatc gttaaataag 

45 5101 tgatgtatac cgatcagctt ttgttccctt tagtgagggt taattgcgcg cttggcgtaa 
5161 tcatggtcat agctgtttcc tgtgtgaaat tgttatccgc tcacaattcc acacaacata 
5221 cgagccggaa gcataaagtg taaagcctgg ggtgcctaat gagtgagcta actcacatta 
52 81 attgcgttgc gctcactgcc cgctttccag tcgggaaacc tgtcgtgcca gctgcattaa 
5341 tgaatcggcc aacgcgcggg gagaggcggt ttgcgtattg ggcgctcttc cgcttcctcg 

50 54 01 ctcactgact cgctgcgctc ggtcgttcgg ctgcggcgag cggtatcagc tcactcaaag 
54 61 gcggtaatac ggttatccac agaatcaggg gataacgcag gaaagaacat gtgagcaaaa 
5521 ggccagcaaa aggccaggaa ccgtaaaaag gccgcgttgc tggcgttttt ccataggctc 
5581 cgcccccctg acgagcatca caaaaatcga cgctcaagtc agaggtggcg aaacccgaca 
5641 ggactataaa gataccaggc gtttccccct ggaagctccc tcgtgcgctc tcctgttccg 

55 57 01 accctgccgc ttaccggata cctgtccgcc tttctccctt cgggaagcgt ggcgctttct 
5761 catagctcac gctgtaggta tctcagttcg gtgtaggtcg ttcgctccaa gctgggctgt 
5821 gtgcacgaac cccccgttca gcccgaccgc tgcgccttat ccggtaacta tcgtcttgag 
5881 tccaacccgg taagacacga cttatcgcca ctggcagcag ccactggtaa caggattagc 
5941 agagcgaggt atgtaggcgg tgctacagag ttcttgaagt ggtggcctaa ctacggctac 

60 6001 actagaagga cagtatttgg tatctgcgct ctgctgaagc cagttacctt cggaaaaaga 
6061 gttggtagct cttgatccgg caaacaaacc accgctggta gcggtggttt ttttgtttgc 
6121 aagcagcaga ttacgcgcag aaaaaaagga tctcaagaag atcctttgat cttttctacg 
6181 gggtctgacg ctcagtggaa cgaaaactca cgttaaggga ttttggtcat gagattatca 
6241 aaaaggatct tcacctagat ccttttaaat taaaaatgaa gttttaaatc aatctaaagt 

65 6301 atatatgagt aaacttggtc tgacagttac caatgcttaa tcagtgaggc acctatctca 
6361 gcgatctgtc tatttcgttc atccatagtt gcctgactcc ccgtcgtgta gataactacg 
6421 atacgggagg gcttaccatc tggccccagt gctgcaatga taccgcgaga cccacgctca 
6481 ccggctccag atttatcagc aataaaccag ccagccggaa gggccgagcg cagaagtggt 
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6541 cctgcaactt tatccgcctc catccagtct attaattgtt gccgggaagc tagagtaagt 

6601 agttcgccag ttaatagttt gcgcaacgtt gttgccattg ctacaggcat cgtggtgtca 

6661 cgctcgtcgt ttggtatggc ttcattcagc tccggttccc aacgatcaag gcgagttaca 

6721 tgatccccca tgttgtgcaa aaaagcggtt agctccttcg gtcctccgat cgttgtcaga 

67 81 agtaagttgg ccgcagtgtt atcactcatg gttatggcag cactgcataa ttctcttact 

6841 gtcatgccat ccgtaagatg cttttctgtg actggtgagt actcaaccaa gtcattctga 

6901 gaatagtgta tgcggcgacc gagttgctct tgcccggcgt caatacggga taataccgcg 

6961 ccacatagca gaactttaaa agtgctcatc attggaaaac gttcttcggg gcgaaaactc 

7021 tcaaggatct taccgctgtt gagatccagt tcgatgtaac ccactcgtgc acccaactga 

7081 tcttcagcat cttttacttt caccagcgtt tctgggtgag caaaaacagg aaggcaaaat 

7141 gccgcaaaaa agggaataag ggcgacacgg aaatgttgaa tactcatact cttccttttt 

7201 caatattatt gaagcattta tcagggttat tgtctcatga gcggatacat atttgaatgt 

72 61 atttagaaaa ataaacaaat aggggttccg cgcacatttc cccgaaaagt gccac 



SEQ ID NO: 7 (pTnMod) 

CTGACGCGCC CTGTAGCGGC GCATTAAGCG CGGCGGGTGT GGTGGTTACG 5 0 
CGCAGCGTGA CCGCTACACT TGCCAGCGCC CTAGCGCCCG CTCCTTTCGC 100 
TTTCTTCCCT TCCTTTCTCG CCACGTTCGC CGGCATCAGA TTGGCTATTG 150 
GCCATTGCAT ACGTTGTATC CATATCATAA TATGTACATT TATATTGGCT 2 00 
CATGTCCAAC ATTACCGCCA TGTTGACATT GATTATTGAC TAGTTATTAA 250 
TAGTAATCAA TTACGGGGTC ATTAGTTCAT AGCCCATATA TGGAGTTCCG 300 
CGTTACATAA CTTACGGTAA ATGGCCCGCC HGGCTGACCG CCCAACGACC 350 
CCCGCCCATT GACGTCAATA ATGACGTATG TTCCCATAGT AACGCCAATA 400 
GGGACTTTCC ATTGACGTCA ATGGGTGGAG TATTTACGGT AAACTGCCCA 4 50 
CTTGGCAGTA CATCAAGTGT ATCATATGCC AAGTACGCCC CCTATTGACG 500 
TCAATGACGG TAAATGGCCC GCCTGGCATT ATGCCCAGTA CATGACCTTA 550 
TGGGACTTTC CTACTTGGCA GTACATCTAC GTATTAGTCA TCGCTATTAC 600 
CATGGTGATG CGGTTTTGGC AGTACATCAA TGGGCGTGGA TAGCGGTTTG 650 
ACTCACGGGG ATTTCCAAGT CTCCACCCCA TTGACGTCAA TGGGAGTTTG 7 00 
TTTTGGCACC AAAATCAACG GGACTTTCCA AAATGTCGTA ACAACTCCGC 750 
CCCATTGACG CAAATGGGCG GTAGGCGTGT ACGGTGGGAG GTCTATATAA 8 00 
GCAGAGCTCG TTTAGTGAAC CGTCAGATCG CCTGGAGACG CCATCCACGC 850 
TGTTTTGACC TCCATAGAAG AGACCGGGAC CGATCCAGCC TCCGCGGCCG 900 
GGAACGGTGC ATTGGAACGC GGATTCCCCG TGCCAAGAGT GACGTAAGTA 950 
CCGCCTATAG ACTCTATAGG CACACCCCTT TGGCTCTTAT GCATGCTATA 1000 
CTGTTTTTGG CTTGGGGCCT ATACACCCCC GCTTCCTTAT GCTATAGGTG 1050 
ATGGTATAGC TTAGCCTATA GGTGTGGGTT ATTGACCATT ATTGACCACT 1100 
CCCCTATTGG TGACGATACT TTCCATTACT AATCCATAAC ATGGCTCTTT 1150 
GCCACAACTA TCTCTATTGG CTATATGCCA ATACTCTGTC CTTCAGAGAC 12 00 
TGACACGGAC TCTGTATTTT TACAGGATGG GGTCCCATTT ATTATTTACA 1250 
AATTCACATA TACAACAACG CCGTCCCCCG TGCCCGCAGT TTTTATTAAA 1300 
CATAGCGTGG GATCTCCACG CGAATCTCGG GTACGTGTTC CGGACATGGG 1350 
CTCTTCTCCG GTAGCGGCGG AGCTTCCACA TCCGAGCCCT GGTCCCATGC 14 00 
CTCCAGCGGC TCATGGTCGC TCGGCAGCTC CTTGCTCCTA ACAGTGGAGG 14 50 
CCAGACTTAG GCACAGCACA ATGCCCACCA CCACCAGTGT GCCGCACAAG 1500 
GCCGTGGCGG TAGGGTATGT GTCTGAAAAT GAGCGTGGAG ATTGGGCTCG 1550 
CACGGCTGAC GCAGATGGAA GACTTAAGGC AGCGGCAGAA GAAGATGCAG 1600 
GCAGCTGAGT TGTTGTATTC TGATAAGAGT CAGAGGTAAC TCCCGTTGCG 1650 
GTGCTGTTAA CGGTGGAGGG CAGTGTAGTC TGAGCAGTAC TCGTTGCTGC 17 00 
CGCGCGCGCC ACCAGACATA ATAGCTGACA GACTAACAGA CTGTTCCTTT 1750 
CCATGGGTCT TTTCTGCAGT CACCGTCGGA CCATGTGTGA ACTTGATATT 18 00 
TTACATGATT CTCTTTACCA ATTCTGCCCC GAATTACACT TAAAACGACT 1850 
CAACAGCTTA ACGTTGGCTT GCCACGCATT ACTTGACTGT AAAACTCTCA 1900 
CTCTTACCGA ACTTGGCCGT AACCTGCCAA CCAAAGCGAG AACAAAACAT 1950 
AACATCAAAC GAATCGACCG ATTGTTAGGT AATCGTCACC TCCACAAAGA 2 000 
GCGACTCGCT GTATACCGTT GGCATGCTAG CTTTATCTGT TCGGGAATAC 2050 
GATGCCCATT GTACTTGTTG ACTGGTCTGA TATTCGTGAG CAAAAACGAC 2100 
TTATGGTATT GCGAGCTTCA GTCGCACTAC ACGGTCGTTC TGTTACTCTT 2150 
TAT GAG AA AG CGTTCCCGCT TTCAGAGCAA TGTTCAAAGA AAGCTCATGA 2200 
CCAATTTCTA GCCGACCTTG CGAGCATTCT ACCGAGTAAC ACCACACCGC 2250 



3/17 



WO 2005/062881 



PCT/US2004/043092 



TCATTGTCAG TGATGCTGGC TTTAAAGTGC CATGGTATAA ATCCGTTGAG 2300 
AAGCTGGGTT GGTACTGGTT AAGTCGAGTA AGAGGAAAAG TACAATATGC 2350 
AGACCTAGGA GCGGAAAACT GGAAACCTAT CAGCAACTTA CAT GAT AT GT 2400 
CATCTAGTCA CTCAAAGACT TTAGGCTATA AGAGGCTGAC TAAAAGCAAT 2 450 
CCAATCTCAT GCCAAATTCT ATTGTATAAA TCTCGCTCTA AAGGCCGAAA 2500 
AAATCAGCGC TCGACACGGA CTCATTGTCA CCACCCGTCA CCTAAAATCT 2550 
ACTCAGCGTC GGCAAAGGAG CCATGGGTTC TAGCAACTAA CTTACCTGTT 2 600 
GAAATTCGAA CACCCAAACA ACTTGTTAAT ATCTATTCGA AGCGAATGCA 2 650 
GATTGAAGAA ACCTTCCGAG ACTTGAAAAG TCCTGCCTAC GGACTAGGCC 27 00 
TACGCCATAG CCGAACGAGC AGCTCAGAGC GTTTTGATAT CATGCTGCTA 2750 
ATCGCCCTGA TGCTTCAACT AACATGTTGG CTTGCGGGCG TTCATGCTCA 2800 
GAAACAAGGT TGGGACAAGC ACTTCCAGGC TAACACAGTC AGAAATCGAA 2 850 
ACGTACTCTC AACAGTTCGC TTAGGCATGG AAGTTTTGCG GCATTCTGGC 2900 
TACACAATAA CAAGGGAAGA CTTACTCGTG GCTGCAACCC TACTAGCTCA 2 950 
AAATTTATTC ACACATGGTT ACGCTTTGGG GAAATTATGA TAATGATCCA 3000 
GATCACTTCT GGCTAATAAA AGAT CAGAGC TCTAGAGATC TGTGTGTTGG 3050 
TTTTTTGTGG ATCTGCTGTG CCTTCTAGTT GCCAGCCATC TGTTGTTTGC 3100 
CCCTCCCCCG TGCCTTCCTT GACCCTGGAA GGTGCCACTC CCACTGTCCT 3150 
TTCCTAATAA AATGAGGAAA TTGCATCGCA TTGTCTGAGT AGGTGTCATT 3200 
CTATTCTGGG GGGTGGGGTG GGGCAGCACA GCAAGGGGGA GGATTGGGAA 3250 
GACAATAGCA GGCATGCTGG GGATGCGGTG GGCTCTATGG GTACCTCTCT 3300 
CTCTCTCTCT CTCTCTCTCT CTCTCTCTCT CTCTCGGTAC CTCTCTCTCT 3350 
CTCTCTCTCT CTCTCTCTCT CTCTCTCTCT CGGTACCAGG TGCTGAAGAA 3400 
TTGACCCGGT GACCAAAGGT GCCTTTTATC ATCACTTTAA AAATAAAAAA 3450 
CAATTACTCA GTGCCTGTTA TAAGCAGCAA TTAATTATGA TTGATGCCTA 3500 
CAT C AC AAC A AAAACTGATT TAACAAATGG TTGGTCTGCC TTAGAAAGTA 3550 
TATTTGAACA TTATCTTGAT TATATTATTG ATAATAATAA AAACCTTATC 3 600 
CCTATCCAAG AAGTGATGCC TATCATTGGT TGGAATGAAC TTGAAAAAAA 3 650 
TTAGCCTTGA ATACATTACT GGTAAGGTAA ACGCCATTGT CAGCAAATTG 37 00 
ATCCAAGAGA ACCAACTTAA AGCTTTCCTG ACGGAATGTT AATTCTCGTT 3750 
GACCCTGAGC ACTGATGAAT CCCCTAATGA TTTTGGTAAA AATCATTAAG 3800 
TTAAGGTGGA TACACATCTT GT CAT AT GAT CCCGGTAATG TGAGTTAGCT 3850 
CACTCATTAG GCACCCCAGG CTTTACACTT TATGCTTCCG GCTCGTATGT 3900 
TGTGTGGAAT TGTGAGCGGA TAACAATTTC ACACAGGAAA CAGCTATGAC 3950 
CATGATTACG CCAAGCGCGC AATTAACCCT CACTAAAGGG AACAAAAGCT 4 000 
GGAGCTCCAC CGCGGTGGCG GCCGCTCTAG AACTAGTGGA TCCCCCGGGC 4 050 
TGCAGGAATT CGATATCAAG CTTATCGATA CCGCTGACCT CGAGGGGGGG 4100 
CCCGGTACCC AATTCGCCCT ATAGTGAGTC GTATTACGCG CGCTCACTGG 4150 
CCGTCGTTTT ACAACGTCGT GACTGGGAAA ACCCTGGCGT TACCCAACTT 4200 
AATCGCCTTG CAGCACATCC CCCTTTCGCC AGCTGGCGTA ATAGCGAAGA 4250 
GGCCCGCACC GATCGCCCTT CCCAACAGTT GCGCAGCCTG AATGGCGAAT 4300 
GGAAATTGTA AGCGTTAATA TTTTGTTAAA ATTCGCGTTA AATTTTTGTT 4 350 
AAATCAGCTC ATTTTTTAAC CAATAGGCCG AAATCGGCAA AATCCCTTAT 4 400 
AAAT C AAA AG AATAGACCGA GATAGGGTTG AGTGTTGTTC CAGTTTGGAA 4 450 
CAAGAGTCCA CTATTAAAGA ACGTGGACTC CAACGTCAAA GGGCGAAAAA 4500 
CCGTCTATCA GGGCGATGGC CCACTACTCC GGGATCATAT GACAAGATGT 4 550 
GTATCCACCT TAACTTAATG ATTTTTACCA AAATCATTAG GGGATTCATC 4 600 
AGTGCTCAGG G TCAACG AG A ATTAACATTC CGTCAGGAAA GCTTATGATG 4 650 
ATGATGTGCT TAAAAACTTA CTCAATGGCT GGTTATGCAT ATCGCAATAC 47 00 
AT GC G AAA AA CCTAAAAGAG CTTGCCGATA AAAAAGGCCA ATTTATTGCT 4750 
ATTTACCGCG GCTTTTTATT GAGCTTGAAA GATAAATAAA ATAGATAGGT 4 8 00 
TTTATTTGAA GCTAAATCTT CTTTATCGTA AAAAATGCCC TCTTGGGTTA 4 8 50 
TCAAGAGGGT CAT TAT AT TT CGCGGAATAA CATCATTTGG TGACGAAATA 4 900 
ACTAAGCACT TGTCTCCTGT TTACTCCCCT GAGCTTGAGG GGTTAACATG 4 950 
AAGGTCATCG ATAGCAGGAT AATAATACAG TAAAACGCTA AACCAATAAT 5000 
CCAAATCCAG CCATCCCAAA TTGGTAGTGA AT GAT TAT AA ATAACAGCAA 5050 
ACAGTAATGG GCCAATAACA CCGGTTGCAT TGGTAAGGCT CACCAATAAT 5100 
CCCTGTAAAG CACCTTGCTG ATGACTCTTT GTTTGGATAG ACATCACTCC 5150 
CTGTAATGCA GGTAAAGCGA TCCCACCACC AGCCAATAAA ATTAAAACAG 5200 
GGAAAACTAA CCAACCTTCA GATATAAACG CTAAAAAGGC AAATGCACTA 5250 
CTATCTGCAA TAAATCCGAG CAGTACTGCC GTTTTTTCGC CCATTTAGTG 5300 
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GCTATTCTTC CTGCCACAAA GGCTTGGAAT ACTGAGTGTA 

ACCCGTAATG AAAAGCCAAC CATCATGCTA TT CAT CATC A 

AATAGCACCA CACCGTGCTG GATTGGCTAT CAATGCGCTG 

CAACAAATGG CATCGTTAAA TAAGTGATGT ATACCGATCA 

5 CCTTTAGTGA GGGTTAATTG CGCGCTTGGC GTAATCATGG 

TTCCTGTGTG AAATTGTTAT CCGCTCACAA TTCCACACAA 

GGAAGCATAA AGTGTAAAGC CTGGGGTGCC TAATGAGTGA 

ATTAATTGCG TTGCGCTCAC TGCCCGCTTT CCAGTCGGGA 

GCCAGCTGCA TTAATGAATC GGCCAACGCG CGGGGAGAGG 

10 ATTGGGCGCT CTTCCGCTTC CTCGCTCACT GACTCGCTGC 

TCGGCTGCGG CGAGCGGTAT CAGCTCACTC AAAGGCGGTA 
CCACAGAATC AGGGGATAAC GCAGGAAAGA ACATGTGAGC 

CAAAAGGCCA GGAACCGTAA AAAGGCCGCG TTGCTGGCGT 
GCTCCGCCCC CCTGACGAGC AT C AC AAAAA TCGACGCTCA 

15 GGCGAAACCC G AC AG G AC T A TAAAGATACC AGGCGTTTCC 

TCCCTCGTGC GCTCTCCTGT TCCGACCCTG CCGCTTACCG 

CGCCTTTCTC CCTTCGGGAA GCGTGGCGCT TTCTCATAGC 

GGTATCTCAG TTCGGTGTAG GTCGTTCGCT CCAAGCTGGG 

GAACCCCCCG TTCAGCCCGA CCGCTGCGCC TTATCCGGTA 

20 TGAGTCCAAC CCGGTAAGAC ACGACTTATC GCCACTGGCA 

GTAACAGGAT TAGCAGAGCG AGGTATGTAG GCGGTGCTAC 
AAGTGGTGGC CTAACTACGG CTACACTAGA AGGACAGTAT 
CGCTCTGCTG AAGCCAGT T A CCTTCGGAAA AAGAGTTGGT 

CCGGCAAACA AACCACCGCT GGTAGCGGTG GTTTTTTTGT 

25 CAGATTACGC GCAGAAAAAA AGGATCTCAA GAAGATCCTT 
TACGGGGTCT GACGCTCAGT GGAACGAAAA CTCACGTTAA 

TCATGAGATT ATCAAAAAGG ATCTTCACCT AGATCCTTTT 

TGAAGTTTTA AATCAATCTA AAGTATATAT GAGTAAACTT 

TTACCAATGC TTAATCAGTG AGGCACCTAT CTCAGCGATC 

30 GTTCATCCAT AGTTGCCTGA CTCCCCGTCG TGTAGATAAC 

GAGGGCTTAC CATCTGGCCC CAGTGCTGCA ATGATACCGC 

CTCACCGGCT CCAGATTTAT CAGCAATAAA CCAGCCAGCC 

AGCGCAGAAG TGGTCCTGCA ACTTTATCCG CCTCCATCCA 

TGTTGCCGGG AAGCTAGAGT AAGTAGTTCG CCAGTTAATA 

35 CGTTGTTGCC ATTGCTACAG GCATCGTGGT GTCACGCTCG 

TGGCTTCATT CAGCTCCGGT TCCCAACGAT CAAGGCGAGT 

CCCATGTTGT GCAAAAAAGC GGTTAGCTCC TTCGGTCCTC 

CAGAAGTAAG TTGGCCGCAG TGTTATCACT CATGGTTATG 

ATAATTCTCT TACTGTCATG CCATCCGTAA GATGCTTTTC 

40 GAGTACTCAA CCAAGTCATT CTGAGAATAG TGTATGCGGC 

CTCTTGCCCG GCGTCAATAC GGGATAATAC CGCGCCACAT 

TAAAAGTGCT CATCATTGGA AAACGTTCTT CGGGGCGAAA 

ATCTTACCGC TGTTGAGATC CAGTTCGATG TAACCCACTC 

CTGATCTTCA GCATCTTTTA CTTTCACCAG CGTTTCTGGG 

45 CAGGAAGGCA AAATGCCGCA AAAAAGGGAA TAAGGGCGAC 

TGAATACTCA TACTCTTCCT TTTTCAATAT TATTGAAGCA 

TTATTGTCTC ATGAGCGGAT ACATATTTGA ATGTATTTAG 

AAATAGGGGT TCCGCGCACA TTTCCCCGAA AAGTGCCAC 

50 SEQ ID NO: 8 (modified Kozak sequence) 
AC CAT G 

SEQ ID NO: 9 (a Kozak sequence) 
ACCATGG 

55 

SEQ ID NO: 10 (a Kozak sequence) 
ACCATGT 

SEQ ID NO: 11 (a Kozak sequence) 
60 AAGATGT 
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AAAGACCAAG 5350 
CGATTTCTGT 54 00 
AAATAATAAT 5450 
GCTTTTGTTC 5500 
TCATAGCTGT ^5550 
CATACGAGCC 5600 
GCTAACTCAC 5 650 
AACCTGTCGT 57 00 
CGGTTTGCGT 5750 
GCTCGGTCGT 5800 
ATACGGTTAT 5850 
AAAAGGCCAG 5900 
TTTTCCATAG 5950 
AGTCAGAGGT 6000 
CCCTGGAAGC 6050 
GATACCTGTC 6100 
TCACGCTGTA 6150 
CTGTGTGCAC 6200 
ACTATCGTCT 6250 
GCAGCCACTG 6300 
AGAGTTCTTG 6350 
TTGGTATCTG 64 00 
AGCTCTTGAT 6450 
TTGCAAGCAG 6500 
TGATCTTTTC 655 0 
GGGATTTTGG 6600 
AAATT AAAAA 6650 
GGTCTGACAG 6700 
TGTCTATTTC 67 50 
TACGATACGG 6800 
GAGACCCACG 6850 
GGAAGGGCCG 6900 
GTCTATTAAT 6950 
GTTTGCGCAA 7000 
TCGTTTGGTA 7 050 
TACATGATCC 7100 
CGATCGTTGT 7150 
GCAGCACTGC 7200 
TGTGACTGGT 7250 
GACCGAGTTG 7300 
AGCAGAACTT 7350 
ACTCTCAAGG 74 00 
GTGCACCCAA 7450 
TGAGCAAAAA 7500 
ACGGAAATGT 7550 
TTTATCAGGG 7 600 
AAAAATAAAC 7 650 

7689 
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SEQ ID NO: 12 (a Kozak sequence) 
ACGATGA 

SEQ ID NO: 13 (a Kozak sequence) 
5 AAGATGG 

SEQ ID NO: 14 (a Kozak sequence) 
GACATGA 

10 SEQ ID NO: 15 (a Kozak sequence) 
ACGATGA 



15 



SEQ ID NO: 16 (a Kozak sequence) 
ACCATGT 



SEQ ID NO: 17 ( conalbumin polyA) 

tctgccattg ctgcttcctc tgcccttcct cgtcactctg aatgtggctt cttcgctact 
gccacagcaa gaaataaaat ctcaacatct aaatgggttt cctgaggttt ttcaagagtc 
gttaagcaca ttccttcccc agcacccctt gctgcaggcc agtgccaggc accaacttgg 
20 ctactgctgc ccatgagaga aatccagttc aatattttcc aaagcaaaat ggattacata 
tgccctagat cctgattaac aggcgtttgt attatctagt gctttcgctt cacccagatt 
atcccattgc ctccc 

SEQ ID NO: 18 (synthetic polyA) 

25 GGCGCCTGGATCCAGATCACTTCTGGCTAA.TAAAAGATCAGAGCTCTAGAGATCTGTGTGTTGGTTTTT 
TGTGGATCTGCTGTGCCTTCTAGTTGCCAGCCATCTGTTGTTTGCCCCTCCCCCGTGCCTTCCTTGACC 
CTGGAAGGTGCCACTCCCACTGTCCTTTCCTAATAAAATGAGGAAATTGCATCGCATTGTCTGAGTAGG 
TGTCATTCTATTCTGGGGGGTGGGGTGGGGCAGCACAGCAAGGGGGAGGATTGGGAAGACAATAGCAGG 
CATGCTGGGGATGCGGTGGGCTCTATGGGTACCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTC 

30 TCTCGGTACCTCTCTC 

SEQ ID NO: 19 (avian optimized polyA) 

ggggatcgc tctagagcga tccgggatct cgggaaaagc gttggtgacc aaaggtgcct 
tttatcatca ctttaaaaat aaaaaacaat tactcagtgc ctgttataag cagcaattaa 
35 ttatgattga tgcctacatc acaacaaaaa ctgatttaac aaatggttgg tctgccttag 
aaagtatatt tgaacattat cttgattata ttattgataa taataaaaac cttatcccta 
tccaagaagt gatgcctatc attggttgga atgaacttga aaaaaattag ccttgaatac 
attactggta aggtaaacgc cattgtcagc aaattgatcc aagagaacca a 

40 SEQ ID NO: 20 

(vitellogenin promoter) 

TGAATGTGTT CTTGTGTTAT CAATATAAAT CACAGTTAGT GATGAAGTTG GCTGCAAGCC 

TGCATCAGTT CAGCTACTTG GCTGCATTTT GTATTTGGTT CTGTAGGAAA TGCAAAAGGT 

TCTAGGCTGA CCTGCACTTC TATCCCTCTT GCCTTACTGC TGAGAATCTC TGCAGGTTTT 

45 AATTGTTCAC ATTTTGCTCC CATTTACTTT GGAAGATAAA AT AT T TAG AG AATGCTTATG 

AAACCTTTGT TCATTTAAAA ATATTCCTGG TCAGCGTGAC CGGAGCTGAA AGAACACATT 

GATCCCGTGA TTTCAATAAA TACATATGTT CCATATATTG TTTCTCAGTA GCCTCTTAAA 

TCATGTGCGT TGGTGCACAT ATGAATACAT GAATAGCAAA GGTTTATCTG GATTACGCTC 

TGGCCTGCAG GAATGGCCAT AAACCAAAGC TGAGGGAAGA GGGAGAGTAT AGTCAATGTA 

50 GAT TAT ACT G ATTGCTGATT GGGTTATTAT CAGCTAGATA ACAACTTGGG TCAGGTGCCA 

GGTCAACATA ACCTGGGCAA AACCAGTCTC ATCTGTGGCA GGACCATGTA CCAGCAGCCA 

GCCGTGACCC AATCTAGGAA AGCAAGTAGC ACATCAATTT TAAATTTATT GTAAATGCCG 

TAGTAGAAGT GTTTTACTGT GATACATTGA AACTTCTGGT CAATCAGAAA AAGGTTTTTT 

ATCAGAGATG CCAAGGTATT ATTTGATTTT CTTTATTCGC CGTGAAGAGA ATTTATGATT 

55 GCAAAAAGAG GAGTGTTTAC ATAAACTGAT AAAAAACTTG AGGAATTCAG CAGAAAACAG 

CCACGTGTTC CTGAACAT.TC TTCCATAAAA GTCTCACCAT GCCTGGCAGA GCCCTATTCA 
CCTTCGCT 

SEQ ID NO: 21 (fragment of ovalbumin promoter - chicken) 
60 GAGGTCAGAAT GGTTTCTTTA CTGTTTGTCA ATTCTATTAT TTCAATACAG 

AACAATAGCT TCTATAACTG AAATATATTT GCTATTGTAT ATTATGATTG 
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TCCCTCGAAC CAT G AAC AC T 
ATCTGCCAGG CCATTAAGTT 
AGTTCATATC ATAAACACAT 
GAGCTATGTT TTGCTGTATC 
5 ACACCCATAA AAAGATAGAT 
CGTCTGCTCT TCACTCTAGT 
TTATTTCTCC TATTTTGTCA 
TTATGTCCTG CCTAGCATGG 
TCAAATGAAA CAGACTTCTG 

10 ACTAACTAAT AATTGCTAAT 
TTTCTGTTTT CTTAAAGATC 
GAACATGAGC AATATTTCCC 
GATTAGCAGA ACAGGCAGAA 
TATTTGCTCT CCATTCAATC 

15 CCCAATCCCA TTAAATGATT 
GGAACCTGTG GGTGGGTCAC 



CCTCCAGCTG AATTTCACAA TTCCTCTGTC 
ATTCATGGAA GATCTTTGAG GAACACT GCA 
TTGAAATTGA GTATTGTTTT GCATTGTATG 
CTCAGAAAAA AAGTTTGTTA TAAAGCATTC 
TTAAATATTC CAGCTATAGG AAAGAAAGTG 
CTCAGTTGGC TCCTTCACAT GCATGCTTCT 
AGAAAATAAT AGGTCACGTC TTGTTCTCAC 
CTCAGATGCA CGTTGTAGAT ACAAGAAGGA 
GTCTGTTACT ACAACCATAG TAATAAGCAC 
TATGTTTTCC ATCTCTAAGG TTCCCACATT 
CCATTATCTG GTTGTAACTG AAGCTCAATG 
AGTCTTCTCT CCCATCCAAC AGTCCTGATG 
AACACATTGT TACCCAGAAT TAAAAACTAA 
CAAAATGGAC CTATTGAAAC TAAAATCTAA 
TCTATGGCGT CAAAGGTCAA ACTTCTGAAG 
AATTCAGGCT ATATATTCCC CAGGGCTCAG 



SEQ ID NO: 22 (chicken ovalbumin ehancer) 

ccgggctgca gaaaaatgcc aggtggacta tgaactcaca tccaaaggag 

20 cttgacctga tacctgattt tcttcaaact ggggaaacaa cacaatccca caaaacagct 

cagagagaaa ccatcactga tggctacagc accaaggtat gcaatggcaa tccattcgac 

attcatctgt gacctgagca aaatgattta tctctccatg aatggttgct tctttccctc 

atgaaaaggc aatttccaca ctcacaatat gcaacaaaga caaacagaga acaattaatg 

tgctccttcc taatgtcaaa attgtagtgg caaagaggag aacaaaatct caagttctga 

25 gtaggtttta gtgattggat aagaggcttt gacctgtgag ctcacctgga cttcatatcc 

ttttggataa aaagtgcttt tataactttc aggtctccga gtctttattc atgagactgt 

tggtttaggg acagacccac aatgaaatgc ctggcatagg aaagggcagc agagccttag 

ctgacctttt cttgggacaa gcattgtcaa acaatgtgtg acaaaactat ttgtactgct 

ttgcacagct gtgctgggca gggcaatcca ttgccaccta tcccaggtaa ccttccaact 

30 gcaagaagat tgttgcttac tctctctaga 



SEQ ID NO: 23 (5' untranslated region) 

GTGGATCAACATACAGCTAGAAAGCTGTATTGCCTTTAGCACTCAAGCTCAAAAGACAACTCAGAGTTC 
ACC 

35 

SEQ ID NO: 24 (putative cap site) 

ACATACAGCTAG AAAGCTGTAT TGCCTTTAGC ACTCAAGCTC AAAAGACAAC TCAGAGTTCA 



SEQ ID NO: 25 (Chicken Ovalbumin Signal Sequence) 

40 ATG GGCTCCATCG GCGCAGCAAG CATGGAATTT TGTTTTGATG TATTCAAGGA GCTCAAAGTC 
CACCATGCCA AT G AGAAC AT CTTCTACTGC CCCATTGCCA TCATGTCAGC TCTAGCCATG 
GTATACCTGG GTGCAAAAGA CAGCACCAGG ACACAGATAA ATAAGGTTGT TCGCTTTGAT 
AAACTTCCAG GATTCGGAGA CAGTATTGAA GCTCAGTGTG GCACATCTGT AAACGTTCAC 
TCTTCACTTA GAGACATCCT CAACCAAATC ACCAAACCAA ATGATGTTTA TTCGTTCAGC 

45 CTTGCCAGTA GACTTTATGC T G AAG AG AG A TACCCAATCC TGCCAGAATA CTTGCAGTGT 
GTGAAGGAAC TGTATAGAGG AGGCTTGGAA CCTATCAACT TTCAAACAGC TGCAGATCAA 
GCCAGAGAGC TCATCAATTC CTGGGTAGAA AGTCAGACAA ATGGAATTAT CAGAAATGTC 
CTTCAGCCAA GCTCCGTGGA TTCTCAAACT GCAATGGTTC TGGTTAATGC CATTGTCTTC 
AAAGGACTGT GGGAGAAAAC ATTTAAGGAT GAAGACACAC AAGCAATGCC TTTCAGAGTG 

50 ACTGAGCAAG AAAGCAAACC TGTGCAGATG ATGTACCAGA TTGGTTTATT TAG AG T G G C A 
TCAATGGCTT CTGAGAAAAT GAAGATCCTG GAGCTTCCAT TTGCCAGTGG GACAATGAGC 
ATGTTGGTGC TGTTGCCTGA TGAAGTCTCA GGCCTTGAGC AGCTTGAGAG TATAATCAAC 
TTTGAAAAAC TGACTGAATG GACCAGTTCT AATGTTATGG AAGAGAGGAA GATCAAAGTG 
TACTTACCTC GCATGAAGAT GGAGGAAAAA TACAACCTCA CATCTGTCTT AATGGCTATG 

55 GGCATTACTG ACGTGTTTAG CTCTTCAGCC AATCTGTCTG GCATCTCCTC AGCAGAGAGC 
C T G AAG AT AT CTCAAGCTGT CCATGCAGCA CATGCAGAAA TCAATGAAGC AGGCAGAGAG 
GTGGTAGGGT CAGCAGAGGC TGGAGTGGAT GCTGCAAGCG TCTCTGAAGA ATTTAGGGCT 
GACCATCCAT TCCTCTTCTG TATCAAGCAC ATCGCAACCA ACGCCGTTCT CTTCTTTGGC 
AGATGTGTTT CCCCT 

60 

SEQ ID NO: 26 (Chicken Ovalbumin Signal Sequence - shortened 50bp) 
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ATG GGCTCCATCG GCGCAGCAAG CATGGAATTT TGTTTTGATG TATTCAAGGA 

SEQ ID NO: 27 (Chicken Ovalbumin Signal Sequence - shortened lOObp) 
ATG GGCTCCATCG GCGCAGCAAG CATGGAATTT TGTTTTGATG TATTCAAGGA GCTCAAAGTC 
5 CACCATGCCA ATGAGAACAT CTTCTACTGC CCCATTGCCA 



SEQ ID NO: 28 (vitellogenin targeting sequence) 

ATGAGGGGGATCATACTGGCATTAGTGCTCACCCTTGTAGGCAGCCAGAAGTTTGACATTGGT 

10 

SEQ ID NO: 29 (pro-insulin sequence) 

TTTGTGAACCAACACCTGTGCGGCTCACACCTGGTGGAAGCTCTCTACCTAGTGTGCGGGGAACGAGGC 
TTCTTCTACACACCCAAGACCCGCCGGGAGGCAGAGGACCTGCAGGTGGGGCAGGTGGAGCTGGGCGGG 
GGCCCTGGTGCAGGCAGCCTGCAGCCCTTGGCCCTGGAGGGGTCCCTGCAGAAGCGTGGCATTGTGGAA 
15 CAATGCTGTACCAGCATCTGCTCCCTCTACCAGCTGGAGAACTCTGCAACTAG 

SEQ ID NO: 30 (pl46 protein) 
KYKKALKKLAKLL 

20 SEQ ID NO: 31 (pl46 coding sequence) 

AAATACAAAAAAGCACTGAAAAAACTGGCAAAACTGCTG 



SEQ ID NO: 32 (spacer) 
25 (GPGG) X 

SEQ ID NO: 33 (spacer) 
GPGGGPGGGPGG 

30 SEQ ID NO: 34 (spacer) 
GGGGSGGGGSGGGGS 



35 



45 



SEQ ID NO: 35 (spacer) 
GGGGSGGGGSGGGGSGGGGS 

SEQ ID NO: 36 (repeat domain in TAG spacer sequence) 
Pro Ala Asp Asp Ala 



SEQ ID NO: 37 (TAG spacer sequence) 
40 Pro Ala Asp Asp Ala Pro Ala Asp Asp Ala Pro Ala Asp Asp Ala Pro Ala 
Asp Asp Ala Pro Ala Asp Asp Ala Pro Ala Asp Asp 



SEQ ID NO: 38 (gp41 epitope) 

Ala Thr Thr Cys lie Leu Lys Gly Ser Cys Gly Trp lie Gly Leu Leu 

SEQ ID NO: 39 (polynucleotide sequence encoding gp41 epitope) 

Pro Ala Asp Asp Ala Pro Ala Asp Asp Ala Thr Thr Cys lie Leu Lys Gly 
Ser Cys Gly Trp lie Gly Leu Leu Asp Asp Asp Asp Lys 

50 SEQ ID NO: 40 (enterokinase cleavage site) 
DDDDK 

SEQ ID NO: 41 (TAG sequence) 

Pro Ala Asp Asp Ala Pro Ala Asp Asp Ala Pro Ala Asp Asp Ala Pro Ala 
55 Asp Asp Ala Pro Ala Asp Asp Ala Pro Ala Asp Asp Ala Thr Thr Cys lie 
Leu Lys Gly Ser Cys Gly Trp lie Gly Leu Leu Asp Asp Asp Asp Lys 

SEQ ID NO: 42 (altered transposase Hef forward primer) 
ATCTCGAGACCATGTGTGAACTTGATATTTTACATGATTCTCTTTACC 



8/17 



WO 2005/062881 



PCT/US2004/043092 



SEQ ID NO: 43 (altered transposase Her reverse primer) 
GATTGATCATTATCATAATTTCCCCAAAGCGTAACC 



SEQ ID NO: 44 GnRH:Phor 11 

Met -Glu-His-Trp-Ser-Tyr-Gly-Leu-Arg-Pro-Gly-Lys-Phe-Ala-Ile-Cys-Lys- 
Lys-Phe-Ala-Ile-Cys-OCH 

10 

SEQ ID NO: 45 GNRH/Phor 14 
EHWSYGLR P GKF AKFAKKFAKF AK 



SEQ ID NO: 4 6 Phorl4 : : Beta-LH Sequence 
1 5 MKFAKFAKKFAKFAKS YAVALSCQCALCRR 

SEQ ID NO: 47 (pTnMCS (CMV-prepro-HCPro-ProLys-LC-CPA) ) 

1 ctgacgcgcc ctgtagcggc gcattaagcg cggcgggtgt ggtggttacg cgcagcgtga 

61 ccgctacact tgccagcgcc ctagcgcccg ctcctttcgc tttcttccct tcctttctcg 

20 121 ccacgttcgc cggcatcaga ttggctattg gccattgcat acgttgtatc catatcataa 

181 tatgtacatt tatattggct catgtccaac attaccgcca tgttgacatt gattattgac 

241 tagttattaa tagtaatcaa ttacggggtc attagttcat agcccatata tggagttccg 

301 cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc cccgcccatt 

361 gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc attgacgtca 

25 421 atgggtggag tatttacggt aaactgccca cttggcagta catcaagtgt atcatatgcc 

481 aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt atgcccagta 

541 catgacctta tgggactttc ctacttggca gtacatctac gtattagtca tcgctattac 

601 catggtgatg cggttttggc agtacatcaa tgggcgtgga tagcggtttg actcacgggg 

661 atttccaagt ctccacccca ttgacgtcaa tgggagtttg ttttggcacc aaaatcaacg 

30 721 ggactttcca aaatgtcgta acaactccgc cccattgacg caaatgggcg gtaggcgtgt 

781 acggtgggag gtctatataa gcagagctcg tttagtgaac cgtcagatcg cctggagacg 

841 ccatccacgc tgttttgacc tccatagaag acaccgggac cgatccagcc tccgcggccg 

901 ggaacggtgc attggaacgc ggattccccg tgccaagagt gacgtaagta ccgcctatag 

961 actctatagg cacacccctt tggctcttat gcatgctata ctgtttttgg cttggggcct 

35 1021 atacaccccc gcttccttat gctataggtg atggtatagc ttagcctata ggtgtgggtt 

1081 attgaccatt attgaccact cccctattgg tgacgatact ttccattact aatccataac 

1141 atggctcttt gccacaacta tctctattgg ctatatgcca atactctgtc cttcagagac 

1201 tgacacggac tctgtatttt tacaggatgg ggtcccattt attatttaca aattcacata 

1261 tacaacaacg ccgtcccccg tgcccgcagt ttttattaaa catagcgtgg gatctccacg 

40 1321 cgaatctcgg gtacgtgttc cggacatggg ctcttctccg gtagcggcgg agcttccaca 

1381 tccgagccct ggtcccatgc ctccagcggc tcatggtcgc tcggcagctc cttgctccta 

1441 acagtggagg ccagacttag gcacagcaca atgcccacca ccaccagtgt gccgcacaag 

1501 gccgtggcgg tagggtatgt gtctgaaaat gagcgtggag attgggctcg cacggctgac 

1561 gcagatggaa gacttaaggc agcggcagaa gaagatgcag gcagctgagt tgttgtattc 

45 1621 tgataagagt cagaggtaac tcccgttgcg gtgctgttaa cggtggaggg cagtgtagtc 

1681 tgagcagtac tcgttgctgc cgcgcgcgcc accagacata atagctgaca gactaacaga 

1741 ctgttccttt ccatgggtct tttctgcagt caccgtcgga ccatgtgcga actcgatatt 

1801 ttacacgact ctctttacca attctgcccc gaattacact taaaacgact caacagctta 

18 61 acgttggctt gccacgcatt acttgactgt aaaactctca ctcttaccga acttggccgt 

50 1921 aacctgccaa ccaaagcgag aacaaaacat aacatcaaac gaatcgaccg attgttaggt 

1981 aatcgtcacc tccacaaaga gcgactcgct gtataccgtt ggcatgctag ctttatctgt 

2041 tcgggcaata cgatgcccat tgtacttgtt gactggtctg atattcgtga gcaaaaacga 

2101 cttatggtat tgcgagcttc agtcgcacta cacggtcgtt ctgttactct ttatgagaaa 

2161 gcgttcccgc tttcagagca atgttcaaag aaagctcatg accaatttct agccgacctt 

55 2221 gcgagcattc taccgagtaa caccacaccg ctcattgtca gtgatgctgg ctttaaagtg 

2281 ccatggtata aatccgttga gaagctgggt tggtactggt taagtcgagt aagaggaaaa 

2341 gtacaatatg cagacctagg agcggaaaac tggaaaccta tcagcaactt acatgatatg 

2401 tcatctagtc actcaaagac tttaggctat aagaggctga ctaaaagcaa tccaatctca 

2461 tgccaaattc tattgtataa atctcgctct aaaggccgaa aaaatcagcg ctcgacacgg 

60 2521 actcattgtc accacccgtc acctaaaatc tactcagcgt cggcaaagga gccatgggtt 

2581 ctagcaacta acttacctgt tgaaattcga acacccaaac aacttgttaa tatctattcg 

2641 aagcgaatgc agattgaaga aaccttccga gacttgaaaa gtcctgccta cggactaggc 

2701 ctacgccata gccgaacgag cagctcagag cgttttgata tcatgctgct aatcgccctg 

27 61 atgcttcaac taacatgttg gcttgcgggc gttcatgctc agaaacaagg ttgggacaag 

65 2821 cacttccagg ctaacacagt cagaaatcga aacgtactct caacagttcg cttaggcatg 

2881 gaagttttgc ggcattctgg ctacacaata acaagggaag acttactcgt ggctgcaacc 
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2941 ctactagctc aaaatttatt cacacatggt 
3001 tctagagcga tccgggatct cgggaaaagc 
3061 ctttaaaaat aaaaaacaat tactcagtgc 
3121 tgcctacatc acaacaaaaa ctgatttaac 
3181 tgaacattat cttgattata ttattgataa 
3241 gatgcctatc attggttgga atgaacttga 
3301 aggtaaacgc cattgtcagc aaattgatcc 
3361 aatgttaatt ctcgttgacc ctgagcactg 
3421 attaagttaa ggtggataca catcttgtca 
3481 cattaggcac cccaggcttt acactttatg 
3541 agcggataac aatttcacac aggaaacagc 
3601 aaccctcact aaagggaaca aaagctggag 
3661 agtggatccc ccgggctgca ggaattcgat 
3721 catcagattg gctattggcc attgcatacg 
3781 attggctcat gtccaacatt accgccatgt 
3841 taatcaatta cggggtcatt agttcatagc 
3901 acggtaaatg gcccgcctgg ctgaccgccc 
3961 acgtatgttc ccatagtaac gccaataggg 
4 021 ttacggtaaa ctgcccactt ggcagtacat 
4081 attgacgtca atgacggtaa atggcccgcc 
4141 gactttccta cttggcagta catctacgta 
4201 ttttggcagt acatcaatgg gcgtggatag 
4261 caccccattg acgtcaatgg gagtttgttt 
4321 tgtcgtaaca actccgcccc attgacgcaa 
4381 tatataagca gagctcgttt agtgaaccgt 
4441 tttgacctcc atagaagaca ccgggaccga 
4501 ggaacgcgga ttccccgtgc caagagtgac 
4561 acccctttgg ctcttatgca tgctatactg 
4 621 tccttatgct ataggtgatg gtatagctta 
4681 gaccactccc ctattggtga cgatactttc 
4741 acaactatct ctattggcta tatgccaata 
4801 gtatttttac aggatggggt cccatttatt 
4861 tcccccgtgc ccgcagtttt tattaaacat 
4921 cgtgttccgg acatgggctc ttctccggta 
4981 cccatgcctc cagcggctca tggtcgctcg 
5041 gacttaggca cagcacaatg cccaccacca 
5101 ggtatgtgtc tgaaaatgag cgtggagatt 
5161 ttaaggcagc ggcagaagaa gatgcaggca 
5221 aggtaactcc cgttgcggtg ctgttaacgg 
5281 ttgctgccgc gcgcgccacc agacataata 
5341 tgggtctttt ctgcagtcac cgtcggatca 
5401 gtgtttacct atatatctaa atttaatatt 
54 61 atgaatttct caaggatatt tttcttcgtg 
5521 tcggctgcgc cagagccgaa aggtacccag 
5581 gtaaagccgg gggggtccct tagagtctcc 
5641 gcctggatga gctgggtccg ccaggctcca 
5701 aaaagcaaaa ttgatggtgg gacaacagac 
5761 atctcaagag atgattcaaa aaacacgtta 
5821 gacacagccg tatattactg taccacgggg 
5881 cccccgaatt ggggccaggg aaccctggtc 
5941 tcggtcttcc ccctggcacc ctcctccaag 
6001 tgcctggtca aggactactt ccccgaaccg 
6061 accagcggcg tgcacacctt tccggctgtc 
6121 aacgtggtga ccgtgccctc cagcagcttg 
6181 cacaagccca gcaacaccaa ggtggacaag 
6241 cacacatgcc caccgtgccc agcacctgaa 
6301 cccccaaaac ccaaggacac cctcatgatc 
6361 gtggacgtga gccacgaaga ccctgaggtc 
6421 gtgcataatg ccaagacaaa gccgcgggag 
6481 agcgtcctca ccgtcctgca ccaggactgg 
6541 tccaacaaag ccctcccagc ccccatcgag 
6601 cgagaaccac aggtgtacac cctgccccca 
6661 agcctgacct gcctggtcaa aggcttctat 
6721 aatgggcagc cggagaacaa ctacaagacc 
6781 ttcttcctct acagcaagct caccgtggac 
6841 tcatgctccg tgatgcatga ggctctgcac 
6901 tctccgggta aagcgccaga gccgaaaaag 
6961 gtgtcagtgt ccccaggaca aacggccagg 



tacgctttgg ggaaattatg aggggatcgc 
gttggtgacc aaaggtgcct tttatcatca 
ctgttataag cagcaattaa ttatgattga 
aaatggttgg tctgccttag aaagtatatt 
taataaaaac cttatcccta tccaagaagt 
aaaaaattag ccttgaatac attactggta 
aagagaacca acttaaagct ttcctgacgg 
atgaatcccc taatgatttt ggtaaaaatc 
tatgatcccg gtaatgtgag ttagctcact 
cttccggctc gtatgttgtg tggaattgtg 
tatgaccatg attacgccaa gcgcgcaatt 
ctccaccgcg gtggcggccg ctctagaact 
atcaagctta tcgataccgc tgacctcgag 
ttgtatccat atcataatat gtacatttat 
tgacattgat tattgactag ttattaatag 
ccatatatgg agttccgcgt tacataactt 
aacgaccccc gcccattgac gtcaataatg 
actttccatt gacgtcaatg ggtggagtat 
caagtgtatc atatgtcaag tacgccccct 
tggcattatg cccagtacat gaccttatgg 
ttagtcatcg ctattaccat ggtgatgcgg 
cggtttgact cacggggatt tccaagtctt 
tggcaccaaa atcaacggga ctttccaaaa 
atgggcggta ggcgtgtacg gtgggaggtc 
cagatcgcct ggagacgcca tccacgctgt 
tccagcctcc gcggccggga acggtgcatt 
gtaagtaccg cctatagact ctataggcac 
tttttggctt ggggcctafca cacccccgct 
gcctataggt gtgggttatt gaccattatt 
cattactaat ccataacatg gctctttgcc 
ctctgtcctt cagagactga cacggactct 
atttacaaat tcacatatac aacaacgccg 
agcgtgggat ctccacgcga atctcgggta 
gcggcggagc ttccacatcc gagccctggt 
gcagctcctt gctcctaaca gtggaggcca 
ccagtgtgcc gcacaaggcc gtggcggtag 
gggctcgcac ggctgacgca gatggaagac 
gctgagttgt tgtattctga taagagtcag 
tggagggcag tgtagtctga gcagtactcg 
gctgacagac taacagactg ttcctttcca 
atcattcatc tcgtgacttc ttcgtgtgtg 
tcgtttatta aaatttaata tatttcgacg 
ttcgctttgg ttctggcttt gtcaacagtt 
gtgcagctgc aggagtcggg gggaggcttg 
tgtgcagcct ctggattcac tttcagaaac 
gggaaggggc tggagtgggt cggccgtatt 
tatgctgcac ccgtgaaagg cagattcacc 
tatctgcaaa tgaatagcct gaaagccgag 
attatgataa catttggggg agttatccct 
accgtctcct cagcctccac caagggccca 
agcacctctg ggggcacagc ggccctgggc 
gtgacggtgt cgtggaactc aggcgccctg 
ctacagtcct caggactcta cttccttagc 
ggcacccaga cctacatctg caacgtgaat 
aaagttgagc ccaaatcttg tgacaaaact 
ctcctggggg gaccgtcagt cttcctcttc 
tcccggaccc ctgaggtcac atgcgtggtg 
aagttcaact ggtacgtgga cggcgtggag 
gagcagtaca acagcacgta ccgtgtggtc 
ctgaatggca aggagtacaa gtgcaaggtc 
aaaaccatct ccaaagccaa agggcagccc 
tcccgggatg agctgaccaa gaaccaggtc 
cccagcgaca tcgccgtgga gtgggagagc 
acgcctcccg tgctggactc cgacggctcc 
aagagcaggt ggcagcaggg gaacgtcttc 
aaccactaca cgcagaagag cctctccctg 
ctttcctatg agctgacaca gccaccctcg 
atcacctgct ctggagatgc attgccagaa 
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7021 aaatatgttt attggtacca gcagaagtca ggccaggccc ctgtggtggt catctatgag 
7081 gacagcaaac gaccctccgg gatccctgag agattctctg gctccagctc agggacaatg 
7141 gccaccttga ctatcagtgg ggcccaggtg gaagatgaag gtgactacta ctgttactca 
7201 actgacagca gtggttatca tagggaggtg ttcagcggag ggaccaagct gaccgtccta 
5 7261 ggtcagccca aggctgcccc ctcggtcact ctgttcccac cctcctctga ggagcttcaa 

7321 gccaacaagg ccacactggt gtgtctcata agtgactcct acccgggagc cgtgacagtg 
7381 gcctggaagg cagatagcag ccccgtcaag gcgggagtgg agaccaccac accctccaaa 
74 41 caaagcaaca acaagtacgc ggccagcagc tacctgagcc tgacgcttga gcagtggaag 
7501 tcccacaaaa gctacagctg ccaggtcacg catgaaggga gcaccgtgga gaagacagtg 
10 7561 gcccctgcag aatgttcacc gcggagggag ggaagggccc tttttgaagg gggaggaaac 

7621 ttcgcgccat gactcctctc gtgccccccg cacggaacac tgatgtgcag agggccctct 
7681 gccattgctg cttcctctgc ccttcctcgt cactctgaat gtggcttctt tgctactgcc 
7741 acagcaagaa ataaaatctc aacatctaaa tgggtttcct gagatttttc aagagtcgtt 
7801 aagcacattc cttccccagc accccttgct gcaggccagt gccaggcacc aacttggcta 
15 78 61 ctgctgccca tgagagaaat ccagttcaat attttccaaa gcaaaatgga ttacatatgc 

7921 cctagatcct gattaacagg tgttttgtat tatctgtgct ttcgcttcac ccacattatc 
7981 ccattgcctc ccctcgaggg ggggcccggt acccaattcg ccctatagtg agtcgtatta 
8041 cgcgcgctca ctggccgtcg ttttacaacg tcgtgactgg gaaaaccctg gcgttaccca 
8101 acttaatcgc cttgcagcac atcccccttt cgccagctgg cgtaatagcg aagaggcccg 
20 8161 caccgatcgc ccttcccaac agttgcgcag cctgaatggc gaatggaaat tgtaagcgtt 

8221 aatattttgt taaaattcgc gttaaatttt tgttaaatca gctcattttt taaccaatag 
8281 gccgaaatcg gcaaaatccc ttataaatca aaagaataga ccgagatagg gttgagtgtt 
8341 gttccagttt ggaacaagag tccactatta aagaacgtgg actccaacgt caaagggcga 
8401 aaaaccgtct atcagggcga tggcccacta ctccgggatc atatgacaag atgtgtatcc 
25 8461 accttaactt aatgattttt accaaaatca ttaggggatt catcagtgct cagggtcaac 

8521 gagaattaac attccgtcag gaaagcttat gatgatgatg tgcttaaaaa cttactcaat 
8581 ggctggttat gcatatcgca atacatgcga aaaacctaaa agagcttgcc gataaaaaag 
8641 gccaatttat tgctatttac cgcggctttt tattgagctt gaaagataaa taaaatagat 
8701 aggttttatt tgaagctaaa tcttctttat cgtaaaaaat gccctcttgg gttatcaaga 
30 8761 gggtcattat atttcgcgga ataacatcat ttggtgacga aataactaag cacttgtctc 

8821 ctgtttactc ccctgagctt gaggggttaa catgaaggtc atcgatagca ggataataat 
8881 acagtaaaac gctaaaccaa taatccaaat ccagccatcc caaattggta gtgaatgatt 
8941 ataaataaca gcaaacagta atgggccaat aacaccggtt gcattggtaa ggctcaccaa 
9001 taatccctgt aaagcacctt gctgatgact ctttgtttgg atagacatca ctccctgtaa 
35 9061 tgcaggtaaa gcgatcccac caccagccaa taaaattaaa acagggaaaa ctaaccaacc 

9121 ttcagatata aacgctaaaa aggcaaatgc actactatct gcaataaatc cgagcagtac 
9181 tgccgttttt tcgcccattt agtggctatt cttcctgcca caaaggcttg gaatactgag 
9241 tgtaaaagac caagacccgt aatgaaaagc caaccatcat gctattcatc atcacgattt 
9301 ctgtaatagc accacaccgt gctggattgg ctatcaatgc gctgaaataa taatcaacaa 
40 9361 atggcatcgt taaataagtg atgtataccg atcagctttt gttcccttta gtgagggtta 

9421 attgcgcgct tggcgtaatc atggtcatag ctgtttcctg tgtgaaattg ttatccgctc 
94 81 acaattccac acaacatacg agccggaagc ataaagtgta aagcctgggg tgcctaatga 
9541 gtgagctaac tcacattaat tgcgttgcgc tcactgcccg ctttccagtc gggaaacctg 
9601 tcgtgccagc tgcattaatg aatcggccaa cgcgcgggga gaggcggttt gcgtattggg 
45 9661 cgctcttccg cttcctcgct cactgactcg ctgcgctcgg tcgttcggct gcggcgagcg 

9721 gtatcagctc actcaaaggc ggtaatacgg ttatccacag aatcagggga taacgcagga 
9781 aagaacatgt gagcaaaagg ccagcaaaag gccaggaacc gtaaaaaggc cgcgttgctg 
9841 gcgtttttcc ataggctccg cccccctgac gagcatcaca aaaatcgacg ctcaagtcag 
9901 aggtggcgaa acccgacagg actataaaga taccaggcgt ttccccctgg aagctccctc 
50 9961 gtgcgctctc ctgttccgac cctgccgctt accggatacc tgtccgcctt tctcccttcg 

10021 ggaagcgtgg cgctttctca tagctcacgc tgtaggtatc tcagttcggt gtaggtcgtt 
10081 cgctccaagc tgggctgtgt gcacgaaccc cccgttcagc ccgaccgctg cgccttatcc 
10141 ggtaactatc gtcttgagtc caacccggta agacacgact tatcgccact ggcagcagcc 
10201 actggtaaca ggattagcag agcgaggtat gtaggcggtg ctacagagtt cttgaagtgg 
55 10261 tggcctaact acggctacac tagaaggaca gtatttggta tctgcgctct gctgaagcca 

10321 gttaccttcg gaaaaagagt tggtagctct tgatccggca aacaaaccac cgctggtagc 
10381 ggtggttttt ttgtttgcaa gcagcagatt acgcgcagaa aaaaaggatc tcaagaagat 
10441 cctttgatct tttctacggg gtctgacgct cagtggaacg aaaactcacg ttaagggatt 
10501 ttggtcatga gattatcaaa aaggatcttc acctagatcc ttttaaatta aaaatgaagt 
60 10561 tttaaatcaa tctaaagtat atatgagtaa acttggtctg acagttacca atgcttaatc 

10621 agtgaggcac ctatctcagc gatctgtcta tttcgttcat ccatagttgc ctgactcccc 
10681 gtcgtgtaga taactacgat acgggagggc ttaccatctg gccccagtgc tgcaatgata 
10741 ccgcgagacc cacgctcacc ggctccagat ttatcagcaa taaaccagcc agccggaagg 
10801 gccgagcgca gaagtggtcc tgcaacttta tccgcctcca tccagtctat taattgttgc 
65 10861 cgggaagcta gagtaagtag ttcgccagtt aatagtttgc gcaacgttgt tgccattgct 

10921 acaggcatcg tggtgtcacg ctcgtcgttt ggtatggctt cattcagctc cggttcccaa 
10981 cgatcaaggc gagttacatg atcccccatg ttgtgcaaaa aagcggttag ctccttcggt 
11041 cctccgatcg ttgtcagaag taagttggcc gcagtgttat cactcatggt tatggcagca 
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11101 ctgcataatt ctcttactgt catgccatcc gtaagatgct tttctgtgac tggtgagtac 
11161 tcaaccaagt cattctgaga atagtgtatg cggcgaccga gttgctcttg cccggcgtca 
11221 atacgggata ataccgcgcc acatagcaga actttaaaag tgctcatcat tggaaaacgt 
11281 tcttcggggc gaaaactctc aaggatctta ccgctgttga gatccagttc gatgtaaccc 
11341 actcgtgcac ccaactgatc ttcagcatct tttactttca ccagcgtttc tgggtgagca 
11401 aaaacaggaa ggcaaaatgc cgcaaaaaag ggaataaggg cgacacggaa atgttgaata 
11461 ctcatactct tcctttttca atattattga agcatttatc agggttattg tctcatgagc 
11521 ggatacatat ttgaatgtat ttagaaaaat aaacaaatag gggttccgcg cacatttccc 
11581 cgaaaagtgc cac 

SEQ ID NO: 48 (pTnMCS (CMV-prepro-HCPro-CPA) ) 

1 ctgacgcgcc ctgtagcggc gcattaagcg cggcgggtgt ggtggttacg cgcagcgtga 
61 ccgctacact tgccagcgcc ctagcgcccg ctcctttcgc tttcttccct tcctttctcg 
121 ccacgttcgc cggcatcaga ttggctattg gccattgcat acgttgtatc catatcataa 
181 tatgtacatt tatattggct catgtccaac attaccgcca tgttgacatt gattattgac 
241 tagttattaa tagtaatcaa ttacggggtc attagttcat agcccatata tggagttccg 
301 cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc cccgcccatt 
361 gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc attgacgtca 
421 atgggtggag tatttacggt aaactgccca cttggcagta catcaagtgt atcatatgcc 
481 aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt atgcccagta 
541 catgacctta tgggactttc ctacttggca gtacatctac gtattagtca tcgctattac 
601 catggtgatg cggttttggc agtacatcaa tgggcgtgga tagcggtttg actcacgggg 
661 atttccaagt ctccacccca ttgacgtcaa tgggagtttg ttttggcacc aaaatcaacg 
721 ggactttcca aaatgtcgta acaactccgc cccattgacg caaatgggcg gtaggcgtgt 
781 acggtgggag gtctatataa gcagagctcg tttagtgaac cgtcagatcg cctggag^acg 
841 ccatccacgc tgttttgacc tccatagaag acaccgggac cgatccagcc tccgcggccg 
901 ggaacggtgc attggaacgc ggattccccg tgccaagagt gacgtaagta ccgcctatag 
961 actctatagg cacacccctt tggctcttat gcatgctata ctgtttttgg cttggggcct 
1021 atacaccccc gcttccttat gctataggtg atggtatagc ttagcctata ggtgtgggtt 
1081 attgaccatt attgaccact cccctattgg tgacgatact ttccattact aatccataac 
1141 atggctcttt gccacaacta tctctattgg ctatatgcca atactctgtc cttcagagac 
1201 tgacacggac tctgtatttt tacaggatgg ggtcccattt attatttaca aattcacata 
12 61 tacaacaacg ccgtcccccg tgcccgcagt ttttattaaa catagcgtgg gatctccacg 
1321 cgaatctcgg gtacgtgttc cggacatggg ctcttctccg gtagcggcgg agcttacaca 
1381 tccgagccct ggtcccatgc ctccagcggc tcatggtcgc tcggcagctc cttgctccta 
1441 acagtggagg ccagacttag gcacagcaca atgcccacca ccaccagtgt gccgcacaag 
1501 gccgtggcgg tagggtatgt gtctgaaaat gagcgtggag attgggctcg cacggctgac 
1561 gcagatggaa gacttaaggc agcggcagaa gaagatgcag gcagctgagt tgttgtattc 
1621 tgataagagt cagaggtaac tcccgttgcg gtgctgttaa cggtggaggg cagtgtagtc 
1681 tgagcagtac tcgttgctgc cgcgcgcgcc accagacata atagctgaca gactaacaga 
1741 ctgttccttt ccatgggtct tttctgcagt caccgtcgga ccatgtgcga actcgatatt 
1801 ttacacgact ctctttacca attctgcccc gaattacact taaaacgact caacagctta 
1861 acgttggctt gccacgcatt acttgactgt aaaactctca ctcttaccga acttggccgt 
1921 aacctgccaa ccaaagcgag aacaaaacat aacatcaaac gaatcgaccg attgttaggt 
1981 aatcgtcacc tccacaaaga gcgactcgct gtataccgtt ggcatgctag ctttatctgt 
2041 tcgggcaata cgatgcccat tgtacttgtt gactggtctg atattcgtga gcaaaaacga 
2101 cttatggtat tgcgagcttc agtcgcacta cacggtcgtt ctgttactct ttatgagaaa 
2161 gcgttcccgc tttcagagca atgttcaaag aaagctcatg accaatttct agccgacctt 
2221 gcgagcattc taccgagtaa caccacaccg ctcattgtca gtgatgctgg ctttaaagtg 
2281 ccatggtata aatccgttga gaagctgggt tggtactggt taagtcgagt aagaggaaaa 
2341 gtacaatatg cagacctagg agcggaaaac tggaaaccta tcagcaactt acatgatatg 
2401 tcatctagtc actcaaagac tttaggctat aagaggctga ctaaaagcaa tccaatctca 
2461 tgccaaattc tattgtataa atctcgctct aaaggccgaa aaaatcagcg ctcgacacgg 
2521 actcattgtc accacccgtc acctaaaatc tactcagcgt cggcaaagga gccatgggtt 
2581 ctagcaacta acttacctgt tgaaattcga acacccaaac aacttgttaa tatctattcg 
2 641 aagcgaatgc agattgaaga aaccttccga gacttgaaaa gtcctgccta cggactaggc 
2701 ctacgccata gccgaacgag cagctcagag cgttttgata tcatgctgct aatcgccctg 
2761 atgcttcaac taacatgttg gcttgcgggc gttcatgctc agaaacaagg ttgggacaag 
2821 cacttccagg ctaacacagt cagaaatcga aacgtactct caacagttcg cttaggcatg 
2881 gaagttttgc ggcattctgg ctacacaata acaagggaag acttactcgt ggctgcaacc 
2941 ctactagctc aaaatttatt cacacatggt tacgctttgg ggaaattatg aggggatcgc 
3001 tctagagcga tccgggatct cgggaaaagc gttggtgacc aaaggtgcct tttatcatca 
3061 ctttaaaaat aaaaaacaat tactcagtgc ctgttataag cagcaattaa ttatgattga 
3121 tgcctacatc acaacaaaaa ctgatttaac aaatggttgg tctgccttag aaagtatatt 
3181 tgaacattat cttgattata ttattgataa taataaaaac cttatcccta tccaagaagt 
3241 gatgcctatc attggttgga atgaacttga aaaaaattag ccttgaatac attactggta 
3301 aggtaaacgc cattgtcagc aaattgatcc aagagaacca acttaaagct ttcctgacgg 
3361 aatgttaatt ctcgttgacc ctgagcactg atgaatcccc taatgatttt ggtaaaaatc 
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3421 attaagttaa ggtggataca catcttgtca 
3481 cattaggcac cccaggcttt acactttatg 
3541 agcggataac aatttcacac aggaaacagc 
3601 aaccctcact aaagggaaca aaagctggag 
3661 agtggatccc ccgggctgca ggaattcgat 
3721 catcagattg gctattggcc attgcatacg 
3781 attggctcat gtccaacatt accgccatgt 
3841 taatcaatta cggggtcatt agttcatagc 
3901 acggtaaatg gcccgcctgg ctgaccgccc 
3961 acgtatgttc ccatagtaac gccaataggg 
4 021 ttacggtaaa ctgcccactt ggcagtacat 
4081 attgacgtca atgacggtaa atggcccgcc 
4141 gactttccta cttggcagta catctacgta 
4201 ttttggcagt acatcaatgg gcgtggatag 
4261 caccccattg acgtcaatgg gagtttgttt 
4321 tgtcgtaaca actccgcccc attgacgcaa 
4381 tatataagca gagctcgttt agtgaaccgt 
4441 tttgacctcc atagaagaca ccgggaccga 
4 501 ggaacgcgga ttccccgtgc caagagtgac 
4561 acccctttgg ctcttatgca tgctatactg 
4 621 tccttatgct ataggtgatg gtatagctta 
4 681 gaccactccc ctattggtga cgatactttc 
4741 acaactatct ctattggcta tatgccaata 
4801 gtatttttac aggatggggt cccatttatt 
4861 tcccccgtgc ccgcagtttt tattaaacat 
4921 cgtgttccgg acatgggctc ttctccggta 
4981 cccatgcctc cagcggctca tggtcgctcg 
5041 gacttaggca cagcacaatg cccaccacca 
5101 ggtatgtgtc tgaaaatgag cgtggagatt 
5161 ttaaggcagc ggcagaagaa gatgcaggca 
5221 aggtaactcc cgttgcggtg ctgttaacgg 
5281 ttgctgccgc gcgcgccacc agacataata 
5341 tgggtctttt ctgcagtcac cgtcggatca 
5401 gtgtttacct atatatctaa atttaatatt 
5461 atgaatttct caaggatatt tttcttcgtg 
5521 tcggctgcgc cagagccgaa aggtacccag 
5581 gtaaagccgg gggggtccct tagagtctcc 
5641 gcctggatga gctgggtccg ccaggctcca 
5701 aaaagcaaaa ttgatggtgg gacaacagac 
5761 atctcaagag atgattcaaa aaacacgtta 
5821 gacacagccg tatattactg taccacgggg 
5881 cccccgaatt ggggccaggg aaccctggtc 
5941 tcggtcttcc ccctggcacc ctcctccaag 
6001 tgcctggtca aggactactt ccccgaaccg 
6061 accagcggcg tgcacacctt tccggctgtc 
6121 aacgtggtga ccgtgccctc cagcagcttg 
6181 cacaagccca gcaacaccaa ggtggacaag 
6241 cacacatgcc caccgtgccc agcacctgaa 
6301 cccccaaaac ccaaggacac cctcatgatc 
6361 gtggacgtga gccacgaaga ccctgaggtc 
6421 gtgcataatg ccaagacaaa gccgcgggag 
6481 agcgtcctca ccgtcctgca ccaggactgg 
6541 tccaacaaag ccctcccagc ccccatcgag 
6601 cgagaaccac aggtgtacac cctgccccca 
6661 agcctgacct gcctggtcaa aggcttctat 
6721 aatgggcagc cggagaacaa ctacaagacc 
6781 ttcttcctct acagcaagct caccgtggac 
6841 tcatgctccg tgatgcatga ggctctgcac 
6901 tctccgggta aagcgccaga gccgaagctt 
6961 tcagtgtccc caggacaaac ggccaggatc 
7021 tatgtttatt ggtaccagca gaagtcaggc 
7081 agcaaacgac cctccgggat ccctgagaga 
7141 accttgacta tcagtggggc ccaggtggaa 
7201 gacagcagtg gttatcatag ggaggtgttc 
7261 cagcccaagg ctgccccctc ggtcactctg 
7321 aacaaggcca cactggtgtg tctcataagt 
7381 tggaaggcag atagcagccc cgtcaaggcg 
7441 agcaacaaca agtacgcggc cagcagctac 



tatgatcccg gtaatgtgag ttagctcact 
cttccggctc gtatgttgtg tggaattgtg 
tatgaccatg attacgccaa gcgcgcaatt 
ctccaccgcg gtggcggccg ctctagaact 
atcaagctta tcgataccgc tgacctcgag 
ttgtatccat atcataatat gtacatttat 
tgacattgat tattgactag ttattaatag 
ccatatatgg agttccgcgt tacataactt 
aacgaccccc gcccattgac gtcaataatg 
actttccatt gacgtcaatg ggtggagtat 
caagtgtatc atatgtcaag tacgccccct 
tggcattatg cccagtacat gaccttatgg 
ttagtcatcg ctattaccat ggtgatgcgg 
cggtttgact cacggggatt tccaagtctt 
tggcaccaaa atcaacggga ctttccaaaa 
atgggcggta ggcgtgtacg gtgggaggtc 
cagatcgcct ggagacgcca tccacgctgt 
tccagcctcc gcggccggga acggtgcatt 
gtaagtaccg cctatagact ctataggcac 
tttttggctt ggggcctata cacccccgct 
gcctataggt gtgggttatt gaccattatt 
cattactaat ccataacatg gctctttgcc 
ctctgtcctt cagagactga cacggactct 
atttacaaat tcacatatac aacaacgccg 
agcgtgggat ctccacgcga atctcgggta 
gcggcggagc ttccacatcc gagccctggt 
gcagctcctt gctcctaaca gtggaggcca 
ccagtgtgcc gcacaaggcc gtggcggtag 
gggctcgcac ggctgacgca gatggaagac 
gctgagttgt tgtattctga taagagtcag 
tggagggcag tgtagtctga gcagtactcg 
gctgacagac taacagactg ttcctttcca 
atcattcatc tcgtgacttc ttcgtgtgtg 
tcgtttatta aaatttaata tatttcgacg 
ttcgctttgg ttctggcttt gtcaacagtt 
gtgcagctgc aggagtcggg gggaggcttg 
tgtgcagcct ctggattcac tttcagaaac 
gggaaggggc tggagtgggt cggccgtatt 
tatgctgcac ccgtgaaagg cagattcacc 
tatctgcaaa tgaatagcct gaaagccgag 
attatgataa catttggggg agttatccct 
accgtctcct cagcctccac caagggccca 
agcacctctg ggggcacagc ggccctgggc 
gtgacggtgt cgtggaactc aggcgccctg 
ctacagtcct caggactcta cttccttagc 
ggcacccaga cctacatctg caacgtgaat 
aaagttgagc ccaaatcttg tgacaaaact 
ctcctggggg gaccgtcagt cttcctcttc 
tcccggaccc ctgaggtcac atgcgtggtg 
aagttcaact ggtacgtgga cggcgtggag 
gagcagtaca acagcacgta ccgtgtggtc 
ctgaatggca aggagtacaa gtgcaaggtc 
aaaaccatct ccaaagccaa agggcagccc 
tcccgggatg agctgaccaa gaaccaggtc 
cccagcgaca tcgccgtgga gtgggagagc 
acgcctcccg tgctggactc cgacggctcc 
aagagcaggt ggcagcaggg gaacgtcttc 
aaccactaca cgcagaagag cctctccctg 
tcctatgagc tgacacagcc accctcggtg 
acctgctctg gagatgcatt gccagaaaaa 
caggcccctg tggtggtcat ctatgaggac 
ttctctggct ccagctcagg gacaatggcc 
gatgaaggtg actactactg ttactcaact 
agcggaggga ccaagctgac cgtcctaggt 
ttcccaccct cctctgagga gcttcaagcc 
gactcctacc cgggagccgt gacagtggcc 
ggagtggaga ccaccacacc ctccaaacaa 
ctgagcctga cgcttgagca gtggaagtcc 
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7501 cacaaaagct acagctgcca 
7561 cctgcagaat gttcaccgcg 
7 621 gcgccatgac tcctctcgtg 
7 681 attgctgctt cctctgccct 
5 7741 gcaagaaata aaatctcaac 

7801 cacattcctt ccccagcacc 
7861 ctgcccatga gagaaatcca 
7921 agatcctgat taacaggtgt 
7981 ttgcctcccc tcgagggggg 
10 8041 gcgctcactg gccgtcgttt 

8101 taatcgcctt gcagcacatc 
8161 cgatcgccct tcccaacagt 
8221 attttgttaa aattcgcgtt 
8281 gaaatcggca aaatccctta 
15 8341 ccagtttgga acaagagtcc 

8401 accgtctatc agggcgatgg 
8461 ttaacttaat gatttttacc 
8521 aattaacatt ccgtcaggaa 
8581 tggttatgca tatcgcaata 
20 8 641 aatttattgc tatttaccgc 

8701 ttttatttga agctaaatct 
8761 tcattatatt tcgcggaata 
8821 tttactcccc tgagcttgag 
8881 gtaaaacgct aaaccaataa 
25 8941 aataacagca aacagtaatg 

9001 tccctgtaaa gcaccttgct 
9061 aggtaaagcg atcccaccac 
9121 agatataaac gctaaaaagg 
9181 cgttttttcg cccatttagt 
30 9241 aaaagaccaa gacccgtaat 

9301 taatagcacc acaccgtgct 
9361 gcatcgttaa ataagtgatg 
9421 gcgcgcttgg cgtaatcatg 
9481 attccacaca acatacgagc 
35 9541 agctaactca cattaattgc 

9601 tgccagctgc attaatgaat 
9661 tcttccgctt cctcgctcac 
9721 tcagctcact caaaggcggt 
9781 aacatgtgag caaaaggcca 
40 9841 tttttccata ggctccgccc 

9901 tggcgaaacc cgacaggact 
9961 cgctctcctg ttccgaccct 
10021 agcgtggcgc tttctcatag 
10081 tccaagctgg gctgtgtgca 
45 10141 aactatcgtc ttgagtccaa 

10201 ggtaacagga ttagcagagc 
10261 cctaactacg gctacactag 
10321 accttcggaa aaagagttgg 
10381 ggtttttttg tttgcaagca 
50 10441 ttgatctttt ctacggggtc 

10501 gtcatgagat tatcaaaaag 
10561 aaatcaatct aaagtatata 
10621 gaggcaccta tctcagcgat 
10681 gtgtagataa ctacgatacg 
55 10741 cgagacccac gctcaccggc 

10801 gagcgcagaa gtggtcctgc 
10861 gaagctagag taagtagttc 
10921 ggcatcgtgg tgtcacgctc 
10981 tcaaggcgag ttacatgatc 
60 11041 ccgatcgttg tcagaagtaa 

11101 cataattctc ttactgtcat 
11161 accaagtcat tctgagaata 
11221 cgggataata ccgcgccaca 
11281 tcggggcgaa aactctcaag 
65 11341 cgtgcaccca actgatcttc 

11401 acaggaaggc aaaatgccgc 
11461 atactcttcc tttttcaata 
11521 tacatatttg aatgtattta 



ggtcacgcat gaagggagca ccgtggagaa gacagtggcc 
gagggaggga agggcccttt ttgaaggggg aggaaacttc 
ccccccgcac ggaacactga tgtgcagagg gccctctgcc 
tcctcgtcac tctgaatgtg gcttctttgc tactgccaca 
atctaaatgg gtttcctgag atttttcaag agtcgttaag 
ccttgctgca ggccagtgcc aggcaccaac . ttggctactg 
gttcaatatt ttccaaagca aaatggatta catatgccct 
tttgtattat ctgtgctttc gcttcaccca cattatocca 
gcccggtacc caattcgccc tatagtgagt cgtattacgc 
tacaacgtcg tgactgggaa aaccctggcg ttacccaact 
cccctttcgc cagctggcgt aatagcgaag aggcccgcac 
tgcgcagcct gaatggcgaa tggaaattgt aagcgttaat 
aaatttttgt taaatcagct cattttttaa ccaataggcc 
taaatcaaaa gaatagaccg agatagggtt gagtgttgtt 
actattaaag aacgtggact ccaacgtcaa agggcgaaaa 
cccactactc cgggatcata tgacaagatg tgtatccacc 
aaaatcatta ggggattcat cagtgctcag ggtcaacgag 
agcttatgat gatgatgtgc ttaaaaactt actcaatggc 
catgcgaaaa acctaaaaga gcttgccgat aaaaaaggcc 
ggctttttat tgagcttgaa agataaataa aatagatagg 
tctttatcgt aaaaaatgcc ctcttgggtt atcaagaggg 
acatcatttg gtgacgaaat aactaagcac ttgtctcctg 
gggttaacat gaaggtcatc gatagcagga taataataca 
tccaaatcca gccatcccaa attggtagtg aatgattata 
ggccaataac accggttgca ttggtaaggc tcaccaataa 
gatgactctt tgtttggata gacatcactc cctgtaatgc 
cagccaataa aattaaaaca gggaaaacta accaaccttc 
caaatgcact actatctgca ataaatccga gcagtactgc 
ggctattctt cctgccacaa aggcttggaa tactgagtgt 
gaaaagccaa ccatcatgct attcatcatc acgatttctg 
ggattggcta tcaatgcgct gaaataataa tcaacaaatg 
tataccgatc agcttttgtt ccctttagtg agggttaatt 
gtcatagctg tttcctgtgt gaaattgtta tccgctcaca 
cggaagcata aagtgtaaag cctggggtgc ctaatgagtg 
gttgcgctca ctgcccgctt tccagtcggg aaacctgtcg 
cggccaacgc gcggggagag gcggtttgcg tattgggcgc 
tgactcgctg cgctcggtcg ttcggctgcg gcgagcggta 
aatacggtta tccacagaat caggggataa cgcaggaaag 
gcaaaaggcc aggaaccgta aaaaggccgc gttgctggcg 
ccctgacgag catcacaaaa atcgacgctc aagtcagagg 
ataaagatac caggcgtttc cccctggaag ctccctcgtg 
gccgcttacc ggatacctgt ccgcctttct cccttcggga 
ctcacgctgt aggtatctca gttcggtgta ggtcgttcgc 
cgaacccccc gttcagcccg accgctgcgc cttatccggt 
cccggtaaga cacgacttat cgccactggc agcagccact 
gaggtatgta ggcggtgcta cagagttctt gaagtggtgg 
aaggacagta tttggtatct gcgctctgct gaagccagtt 
tagctcttga tccggcaaac aaaccaccgc tggtagcggt 
gcagattacg cgcagaaaaa aaggatctca agaagatcct 
tgacgctcag tggaacgaaa actcacgtta agggattttg 
gatcttcacc tagatccttt taaattaaaa atgaagtttt 
tgagtaaact tggtctgaca gttaccaatg cttaatcagt 
ctgtctattt cgttcatcca tagttgcctg actccccgtc 
ggagggctta ccatctggcc ccagtgctgc aatgataccg 
tccagattta tcagcaataa accagccagc cggaagggcc 
aactttatcc gcctccatcc agtctattaa ttgttgccgg 
gccagttaat agtttgcgca acgttgttgc cattgctaca 
gtcgtttggt atggcttcat tcagctccgg ttcccaacga 
ccccatgttg tgcaaaaaag cggttagctc cttcggtcct 
gttggccgca gtgttatcac tcatggttat ggcagcactg 
gccatccgta agatgctttt ctgtgactgg tgagtactca 
gtgtatgcgg cgaccgagtt gctcttgccc ggcgtcaata 
tagcagaact ttaaaagtgc tcatcattgg aaaacgttct 
gatcttaccg ctgttgagat ccagttcgat gtaacccact 
agcatctttt actttcacca gcgtttctgg gtgagcaaaa 
aaaaaaggga ataagggcga cacggaaatg ttgaatactc 
ttattgaagc atttatcagg gttattgtct catgagcgga 
gaaaaataaa caaatagggg ttccgcgcac atttccccga 
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11581 aaagtgccac 



SEQIDNO:49 pTnMCS (Chicken OVep+OVg' +ENT+proins+syn polyA) 
5 1 ctgacgcgcc ctgtagcggc gcattaagcg cggcgggtgt ggtggttacg cgcagcgtga 

61 ccgctacact tgccagcgcc ctagcgcccg ctcctttcgc tttcttccct tcctttctcg 

121 ccacgttcgc cggcatcaga ttggctattg gccattgcat acgttgtatc catatcataa 

181 tatgtacatt tatattggct catgtccaac attaccgcca tgttgacatt gattattgac 

241 tagttattaa tagtaatcaa ttacggggtc attagttcat agcccatata tggagttccg 

10 301 cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc cccgcccatt 

361 gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc attgacgtca 

421 atgggtggag tatttacggt aaactgccca cttggcagta catcaagtgt atcatatgcc 

481 aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt atgcccagta 

541 catgacctta tgggactttc ctacttggca gtacatctac gtattagtca tcgctattac 

15 601 catggtgatg cggttttggc agtacatcaa tgggcgtgga tagcggtttg actcacgggg 

661 atttccaagt ctccacccca ttgacgtcaa tgggagtttg ttttggcacc aaaatcaacg 

721 ggactttcca aaatgtcgta acaactccgc cccattgacg caaatgggcg gtaggcgtgt 

781 acggtgggag gtctatataa gcagagctcg tttagtgaac cgtcagatcg cctggagacg 

841 ccatccacgc tgttttgacc tccatagaag acaccgggac cgatccagcc tccgcggccg 

20 901 ggaacggtgc attggaacgc ggattccccg tgccaagagt gacgtaagta ccgcctatag 

961 actctatagg cacacccctt tggctcttat gcatgctata ctgtttttgg cttggggcct 

1021 atacaccccc gcttccttat gctataggtg atggtatagc ttagcctata ggtgtgggtt 

1081 attgaccatt attgaccact cccctattgg tgacgatact ttccattact aatccataac 

1141 atggctcttt gccacaacta tctctattgg ctatatgcca atactctgtc cttcagagac 

25 1201 tgacacggac tctgtatttt tacaggatgg ggtcccattt attatttaca aattcacata 

1261 tacaacaacg ccgtcccccg tgcccgcagt ttttattaaa catagcgtgg gatctccacg 

1321 cgaatctcgg gtacgtgttc cggacatggg ctcttctccg gtagcggcgg agcttccaca 

1381 tccgagccct ggtcccatgc ctccagcggc tcatggtcgc tcggcagctc cttgctccta 

1441 acagtggagg ccagacttag gcacagcaca atgcccacca ccaccagtgt gccgcacaag 

30 1501 gccgtggcgg tagggtatgt gtctgaaaat gagcgtggag attgggctcg cacggctgac 

1561 gcagatggaa gacttaaggc agcggcagaa gaagatgcag gcagctgagt tgttgtattc 

1621 tgataagagt cagaggtaac tcccgttgcg gtgctgttaa cggtggaggg cagtgtagtc 

1681 tgagcagtac tcgttgctgc cgcgcgcgcc accagacata atagctgaca gactaacaga 

1741 ctgttccttt ccatgggtct tttctgcagt caccgtcgga ccatgtgcga actcgatatt 

35 1801 ttacacgact ctctttacca attctgcccc gaattacact taaaacgact caacagctta 

18 61 acgttggctt gccacgcatt acttgactgt aaaactctca ctcttaccga acttggccgt 

1921 aacctgccaa ccaaagcgag aacaaaacat aacatcaaac gaatcgaccg attgttaggt 

1981 aatcgtcacc tccacaaaga gcgactcgct gtataccgtt ggcatgctag ctttatctgt 

2041 tcgggcaata cgatgcccat tgtacttgtt gactggtctg atattcgtga gcaaaaacga 

40 2101 cttatggtat tgcgagcttc agtcgcacta cacggtcgtt ctgttactct ttatgagaaa 

2161 gcgttcccgc tttcagagca atgttcaaag aaagctcatg accaatttct agccgacctt 

2221 gcgagcattc taccgagtaa caccacaccg ctcattgtca gtgatgctgg ctttaaagtg 

2281 ccatggtata aatccgttga gaagctgggt tggtactggt taagtcgagt aagaggaaaa 

2341 gtacaatatg cagacctagg agcggaaaac tggaaaccta tcagcaactt acatgatatg 

45 24 01 tcatctagtc actcaaagac tttaggctat aagaggctga ctaaaagcaa tccaatctca 

2461 tgccaaattc tattgtataa atctcgctct aaaggccgaa aaaatcagcg ctcgacacgg 

2521 actcattgtc accacccgtc acctaaaatc tactcagcgt cggcaaagga gccatgggtt 

2581 ctagcaacta acttacctgt tgaaattcga acacccaaac aacttgttaa tatctattcg 

2641 aagcgaatgc agattgaaga aaccttccga gacttgaaaa gtcctgccta cggactaggc 

50 2701 ctacgccata gccgaacgag cagctcagag cgttttgata tcatgctgct aatcgccctg 

2761 atgcttcaac taacatgttg gcttgcgggc gttcatgctc agaaacaagg ttgggacaag 

2821 cacttccagg ctaacacagt cagaaatcga aacgtactct caacagttcg cttaggcatg 

2881 gaagttttgc ggcattctgg ctacacaata acaagggaag acttactcgt ggctgcaacc 

2941 ctactagctc aaaatttatt cacacatggt tacgctttgg ggaaattatg aggggatcgc 

55 3001 tctagagcga tccgggatct cgggaaaagc gttggtgacc aaaggtgcct tttatcatca 

3061 ctttaaaaat aaaaaacaat tactcagtgc ctgttataag cagcaattaa ttatgattga 

3121 tgcctacatc acaacaaaaa ctgatttaac aaatggttgg tctgccttag aaagtatatt 

3181 tgaacattat cttgattata ttattgataa taataaaaac cttatcccta tccaagaagt 

3241 gatgcctatc attggttgga atgaacttga aaaaaattag ccttgaatac attactggta 

60 3301 aggtaaacgc cattgtcagc aaattgatcc aagagaacca acttaaagct ttcctgacgg 

3361 aatgttaatt ctcgttgacc ctgagcactg atgaatcccc taatgatttt ggtaaaaatc 

3421 attaagttaa ggtggataca catcttgtca tatgatcccg gtaatgtgag ttagctcact 

3481 cattaggcac cccaggcttt acactttatg cttccggctc gtatgttgtg tggaattgtg 

3541 agcggataac aatttcacac aggaaacagc tatgaccatg attacgccaa gcgcgcaatt 

65 3601 aaccctcact aaagggaaca aaagctggag ctccaccgcg gtggcggccg ctctagaact 

3661 agtggatccc ccgggctgca gaaaaatgcc aggtggacta tgaactcaca tccaaaggag 
3721 cttgacctga tacctgattt tcttcaaact ggggaaacaa cacaatccca caaaacagct 
3781 cagagagaaa ccatcactga tggctacagc accaaggtat gcaatggcaa tccattcgac 
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3841 attcatctgt gacctgagca aaatgattta tctctccatg aatggttgct tctttccctc 
3901 atgaaaaggc aatttccaca ctcacaatat gcaacaaaga caaacagaga acaattaatg 
3961 tgctccttcc taatgtcaaa attgtagtgg caaagaggag aacaaaatct caagttctga 
4021 gtaggtttta gtgattggat aagaggcttt gacctgtgag ctcacctgga cttcatatcc 
5 4081 ttttggataa aaagtgcttt tataactttc aggtctccga gtctttattc atgagactcjt 

4141 tggtttaggg acagacccac aatgaaatgc ctggcatagg aaagggcagc agagccttag 
4201 ctgacctttt cttgggacaa gcattgtcaa acaatgtgtg acaaaactat ttgtactgct 
4261 ttgcacagct gtgctgggca gggcaatcca ttgccaccta tcccaggtaa ccttccaact 
4321 gcaagaagat tgttgcttac tctctctaga aagcttctgc agactgacat gcatttcata 

10 4381 ggtagagata acatttactg ggaagcacat ctatcatcat aaaaagcagg caagattttc 

4441 agactttctt agtggctgaa atagaagcaa aagacgtgat taaaaacaaa atgaaacaaa 
4501 aaaaatcagt tgatacctgt ggtgtagaca tccagcaaaa aaatattatt tgcactacca 
4561 tcttgtctta agtcctcaga cttggcaagg agaatgtaga tttctacagt atatatgttt 
4621 tcacaaaagg aaggagagaa acaaaagaaa atggcactga ctaaacttca gctagtgg-ta 

15 4 681 taggaaagta attctgctta acagagattg cagtgatctc tatgtatgtc ctgaagaatt 

4741 atgttgtact tttttccccc atttttaaat caaacagtgc tttacagagg tcagaatggt 
4801 ttctttactg tttgtcaatt ctattatttc aatacagaac aatagcttct ataactgaaa 
4861 tatatttgct attgtatatt atgattgtcc ctcgaaccat gaacactcct ccagctgaat 
4921 ttcacaattc ctctgtcatc tgccaggcca ttaagttatt catggaagat ctttgaggaa 

20 4981 cactgcaagt tcatatcata aacacatttg aaattgagta ttgttttgca ttgtatggag 

5041 ctatgttttg ctgtatcctc agaaaaaaag tttgttataa agcattcaca cccataaaaa 
5101 gatagattta aatattccag ctataggaaa gaaagtgcgt ctgctcttca ctctagtctc 
5161 agttggctcc ttcacatgca tgcttcttta tttctcctat tttgtcaaga aaataatagg 
5221 tcacgtcttg ttctcactta tgtcctgcct agcatggctc agatgcacgt tgtagataca 

25 5281 agaaggatca aatgaaacag acttctggtc tgttactaca accatagtaa taagcaeact 

5341 aactaataat tgctaattat gttttccatc tctaaggttc ccacattttt ctgttttctt 
5401 aaagatccca ttatctggtt gtaactgaag ctcaatggaa catgagcaat atttcccagt 
5461 cttctctccc atccaacagt cctgatggat tagcagaaca ggcagaaaac acattgttac 
5521 ccagaattaa aaactaatat ttgctctcca ttcaatccaa aatggaccta ttgaaactaa 

30 5581 aatctaaccc aatcccatta aatgatttct atggcgtcaa aggtcaaact tctgaaggga 

5641 acctgtgggt gggtcacaat tcaggctata tattccccag ggctcagcca gtggatcaac 
5701 atacagctag aaagctgtat tgcctttagc actcaagctc aaaagacaac tcagagttca 
5761 ccatgggctc catcggcgca gcaagcatgg aattttgttt tgatgtattc aaggagctca 
5821 aagtccacca tgccaatgag aacatcttct actgccccat tgccatcatg tcagctctag 

35 5881 ccatggtata cctgggtgca aaagacagca ccaggacaca gataaataag gttgttcgct 

5941 ttgataaact tccaggattc ggagacagta ttgaagctca gtgtggcaca tctgtaaacg 
6001 ttcactcttc acttagagac atcctcaacc aaatcaccaa accaaatgat gtttattcgt 
6061 tcagccttgc cagtagactt tatgctgaag agagataccc aatcctgcca gaatacttgc 
6121 agtgtgtgaa ggaactgtat agaggaggct tggaacctat caactttcaa acagctgcag 

40 6181 atcaagccag agagctcatc aattcctggg tagaaagtca gacaaatgga attatcagaa 

6241 atgtccttca gccaagctcc gtggattctc aaactgcaat ggttctggtt aatgccattg 
6301 tcttcaaagg actgtgggag aaaacattta aggatgaaga cacacaagca atgccttfcca 
6361 gagtgactga gcaagaaagc aaacctgtgc agatgatgta ccagattggt ttatttagag 
6421 tggcatcaat ggcttctgag aaaatgaaga tcctggagct tccatttgcc agtgggacaa 

45 6481 tgagcatgtt ggtgctgttg cctgatgaag tctcaggcct tgagcagctt gagagtafcaa 

6541 tcaactttga aaaactgact gaatggacca gttctaatgt tatggaagag aggaagatca 
6601 aagtgtactt acctcgcatg aagatggagg aaaaatacaa cctcacatct gtcttaatgg 
6661 ctatgggcat tactgacgtg tttagctctt cagccaatct gtctggcatc tcctcagcag 
6721 agagcctgaa gatatctcaa gctgtccatg cagcacatgc agaaatcaat gaagcaggca 

50 6781 gagaggtggt agggtcagca gaggctggag tggatgctgc aagcgtctct gaagaatfcta 

6841 gggctgacca tccattcctc ttctgtatca agcacatcgc aaccaacgcc gttctcttct 
6901 ttggcagatg tgtttctccg cggccagcag atgacgcacc agcagatgac gcaccagcag 
6961 atgacgcacc agcagatgac gcaccagcag atgacgcacc agcagatgac gcaacaacat 
7021 gtatcctgaa aggctcttgt ggctggatcg gcctgctgga tgacgatgac aaatttgtga 

55 7081 accaacacct gtgcggctca cacctggtgg aagctctcta cctagtgtgc ggggaacgag 

7141 gcttcttcta cacacccaag acccgccggg aggcagagga cctgcaggtg gggcaggtgg 
7201 agctgggcgg gggccctggt gcaggcagcc tgcagccctt ggccctggag gggtccctgc 
7261 agaagcgtgg cattgtggaa caatgctgta ccagcatctg ctccctctac cagctggaga 
7321 actactgcaa ctagggcgcc taaagggcga attatcgcgg ccgctctaga ccaggcgcct 

60 7381 ggatccagat cacttctggc taataaaaga tcagagctct agagatctgt gtgttggttt 

7441 tttgtggatc tgctgtgcct tctagttgcc agccatctgt tgtttgcccc tcccccgtgc 
7501 cttccttgac cctggaaggt gccactccca ctgtcctttc ctaataaaat gaggaaattg 
7561 catcgcattg tctgagtagg tgtcattcta ttctgggggg tggggtgggg cagcacagca 
7 621 agggggagga ttgggaagac aatagcaggc atgctgggga tgcggtgggc tctatgggta 

65 7681 cctctctctc tctctctctc tctctctctc tctctctctc tcggtacctc tctcgagggg 

7741 gggcccggta cccaattcgc cctatagtga gtcgtattac gcgcgctcac tggccgtcgt 
7801 tttacaacgt cgtgactggg aaaaccctgg cgttacccaa cttaatcgcc ttgcagcaca 
7861 tccccctttc gccagctggc gtaatagcga agaggcccgc accgatcgcc cttcccaaca 
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7921 gttgcgcagc ctgaatggcg aatggaaatt gtaagcgtta atattttgtt aaaattcgcg 
7981 ttaaattttt gttaaatcag ctcatttttt aaccaatagg ccgaaatcgg caaaatccct 
8041 tataaatcaa aagaatagac cgagataggg ttgagtgttg ttccagtttg gaacaagagt 
8101 ccactattaa agaacgtgga ctccaacgtc aaagggcgaa aaaccgtcta tcagggcgat 
5 8161 ggcccactac tccgggatca tatgacaaga tgtgtatcca ccttaactta atgattttta 

8221 ccaaaatcat taggggattc atcagtgctc agggtcaacg agaattaaca ttccgtcagg 
8281 aaagcttatg atgatgatgt gcttaaaaac ttactcaatg gctggttatg catatcgcaa 
8341 tacatgcgaa aaacctaaaa gagcttgccg ataaaaaagg ccaatttatt gctatttacc 
8401 gcggcttttt attgagcttg aaagataaat aaaatagata ggttttattt gaagctaaat 
10 8461 cttctttatc gtaaaaaatg ccctcttggg ttatcaagag ggtcattata tttcgcggaa 

8521 taacatcatt tggtgacgaa ataactaagc acttgtctcc tgtttactcc cctgagcttg 
8581 aggggttaac atgaaggtca tcgatagcag gataataata cagtaaaacg ctaaaccaat 
8641 aatccaaatc cagccatccc aaattggtag tgaatgatta taaataacag caaacagtaa 
8701 tgggccaata acaccggttg cattggtaag gctcaccaat aatccctgta aagcaccttg 
15 8761 ctgatgactc tttgtttgga tagacatcac tccctgtaat gcaggtaaag cgatcccacc 

8821 accagccaat aaaattaaaa cagggaaaac taaccaacct tcagatataa acgctaaaaa 
8881 ggcaaatgca ctactatctg caataaatcc gagcagtact gccgtttttt cgcccattta 
8941 gtggctattc ttcctgccac aaaggcttgg aatactgagt gtaaaagacc aagacccgta 
9001 atgaaaagcc aaccatcatg ctattcatca tcacgatttc tgtaatagca ccacaccgtg 
20 9061 ctggattggc tatcaatgcg ctgaaataat aatcaacaaa tggcatcgtt aaataagtga 

9121 tgtataccga tcagcttttg ttccctttag tgagggttaa ttgcgcgctt ggcgtaatca 
9181 tggtcatagc tgtttcctgt gtgaaattgt tatccgctca caattccaca caacatacga 
9241 gccggaagca taaagtgtaa agcctggggt gcctaatgag tgagctaact cacattaatt 
9301 gcgttgcgct cactgcccgc tttccagtcg ggaaacctgt cgtgccagct gcattaatga 
25 9361 atcggccaac gcgcggggag aggcggtttg cgtattgggc gctcttccgc ttcctcgctc 

9421 actgactcgc tgcgctcggt cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg 
9481 gtaatacggt tatccacaga atcaggggat aacgcaggaa agaacatgtg agcaaaaggc 
9541 cagcaaaagg ccaggaaccg taaaaaggcc gcgttgctgg cgtttttcca taggctccgc 
9601 ccccctgacg agcatcacaa aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga 
30 9661 ctataaagat accaggcgtt tccccctgga agctccctcg tgcgctctcc tgttccgacc 

9721 ctgccgctta ccggatacct gtccgccttt ctcccttcgg gaagcgtggc gctttctcat 
9781 agctcacgct gtaggtatct cagttcggtg taggtcgttc gctccaagct gggctgtgtg 
9841 cacgaacccc ccgttcagcc cgaccgctgc gccttatccg gtaactatcg tcttgagtcc 
9901 aacccggtaa gacacgactt atcgccactg gcagcagcca ctggtaacag gattagcaga 
35 9961 gcgaggtatg taggcggtgc tacagagttc ttgaagtggt ggcctaacta cggctacact 

10021 agaaggacag tatttggtat ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt 
10081 ggtagctctt gatccggcaa acaaaccacc gctggtagcg gtggtttttt tgtttgcaag 
10141 cagcagatta cgcgcagaaa aaaaggatct caagaagatc ctttgatctt ttctacgggg 
10201 tctgacgctc agtggaacga aaactcacgt taagggattt tggtcatgag attatcaaaa 
40 10261 aggatcttca cctagatcct tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata 

10321 tatgagtaaa cttggtctga cagttaccaa tgcttaatca gtgaggcacc tatctcagcg 
10381 atctgtctat ttcgttcatc catagttgcc tgactccccg tcgtgtagat aactacgata 
10441 cgggagggct taccatctgg ccccagtgct gcaatgatac cgcgagaccc acgctcaccg 
10501 gctccagatt tatcagcaat aaaccagcca gccggaaggg ccgagcgcag aagtggtcct 
45 10561 gcaactttat ccgcctccat ccagtctatt aattgttgcc gggaagctag agtaagtagt 

10621 tcgccagtta atagtttgcg caacgttgtt gccattgcta caggcatcgt ggtgtcacgc 
10681 tcgtcgtttg gtatggcttc attcagctcc ggttcccaac gatcaaggcg agttacatga 
10741 tcccccatgt tgtgcaaaaa agcggttagc tccttcggtc ctccgatcgt tgtcagaagt 
10801 aagttggccg cagtgttatc actcatggtt atggcagcac tgcataattc tcttactgtc 
50 10861 atgccatccg taagatgctt ttctgtgact ggtgagtact caaccaagtc attctgagaa 

10921 tagtgtatgc ggcgaccgag ttgctcttgc ccggcgtcaa tacgggataa taccgcgcca 
10981 catagcagaa ctttaaaagt gctcatcatt ggaaaacgtt cttcggggcg aaaactctca 
11041 aggatcttac cgctgttgag atccagttcg atgtaaccca ctcgtgcacc caactgatct 
11101 tcagcatctt ttactttcac cagcgtttct gggtgagcaa aaacaggaag gcaaaatgcc 
55 11161 gcaaaaaagg gaataagggc gacacggaaa tgttgaatac tcatactctt cctttttcaa 

11221 tattattgaa gcatttatca gggttattgt ctcatgagcg gatacatatt tgaatgtatt 
11281 tagaaaaata aacaaatagg ggttccgcgc acatttcccc gaaaagtgcc ac 
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